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RELATED APPLICATION 

5 This application claims the benefit of U.S. 

Provisional Patent Application No. 60/231,510 filed on 
September 9, 2000. 

FIELD OF THE INVENTION 

10 This invention relates generally to signal 

processors. More particularly, the invention relates to 
telephone signal processors and to voice activity 
detectors for integrated telecommunications processing. 

BACKGROUND OF THE INVENTION 

15 Single chip digital signal processing devices (DSP) 

are relatively well known. DSPs generally are 
distinguished from general purpose microprocessors in 
that DSPs typically support accelerated arithmetic 
operations by including a dedicated multiplier and 

20 accumulator (MAC) for performing multiplication of 

digital numbers. The instruction set for a typical DSP 

device usually includes a MAC instruction for performing 
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multiplication of new operands and addition with a prior 
accumulated value stored within an accumulator register. 

A MAC instruction is typically the only instruction 
provided in prior art digital signal processors where two 
5 DSP operations, multiply followed by add, are performed 
by the execution of one instruction. However, when 
performing signal processing functions on data it is 
often desirable to perform other DSP operations in 
varying combinations . 

10 An area where DSPs may be utilized is in 

telecommunication systems. One use of DSPs in 
telecommunication systems is digital filtering. In this 
case a DSP is typically programmed with instructions to 
implement some filter function in the digital or time 

15 domain. The mathematical algorithm for a typical finite 
impulse response (FIR) filter may look like the equation 
Y n = h 0 X 0 + hiXi + h 2 X 2 + . . . + h N X N where h n are fixed 
filter coefficients numbering from 1 to N and X n are the 
data samples. The equation Y n may be evaluated by using a 

20 software program. However in some applications, it is 
necessary that the equation be evaluated as fast as 
possible. One way to do this is to perform the 
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computations using hardware components such as a DSP 
device programmed to compute the equation Y n . In order to 
further speed the process, it is desirable to vectorize 
the equation and distribute the computation amongst 
5 multiple DSPs such that the final result is obtained more 
quickly. The multiple DSPs operate in parallel to speed 
the computation process. In this case, the multiplication 
of terms is spread across the multipliers of the DSPs 
equally for simultaneous computations of terms. The 

10 adding of terms is similarly spread equally across the 
adders of the DSPs for simultaneous computations. In 
vectorized processing, the order of processing terms is 
unimportant since the combination is associative. If the 
processing order of the terms is altered, it has no 

15 effect on the final result expected in a vectorized 
processing of a function. 

One area where finite impulse response filters is 
applied is in echo cancellation for telephony processing. 
Echo cancellation is used to cancel echoes over full 

20 duplex telephone communication channels. The echo- 
cancellation process isolates and filters the unwanted 
signals caused by echoes from the main transmitted signal 
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in a two-way transmission. Single or multiple DSP chips 
can be used to implement an echo canceller having finite 
impulse response filter to provide echo cancellation. 
However, echo cancellation is only one part of 
telecommunication processing. Typically, telephone 
processing functions are spread over multiple devices, 
components or boards in a telephone communication system. 

Referring now to Figure 8, a typical prior art 
telephone communication system is illustrated. A 
telephone, fax, or data modem couples to a local 
subscriber loop 802 at one end and another local 
subscriber loop 802' at an opposite end. Each of the 
local subscriber loops 802 and 802' couple to 2 -wire/4 - 
wire hybrid circuits 804 and 804' . Hybrid circuits 804 
are composed of resistor networks, capacitors, and 
ferrite-core transformers. Hybrids circuits 804 convert 
4 -wire telephone trunk lines 806 (a pair in each 
direction) running between telephone exchanges of the 
PSTN 812 to each of the 2-wire local subscriber loops 802 
and 8 02' . The hybrid circuits 8 04 is intended to direct 
all the energy from a talker on the 4 -wire trunk 8 06 at a 
far-end to a listener on a 2-wire local subscriber loop 
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8 02 at a near end. 

Echoes 810' are often formed when a speech signal 
from a far end talker leaves a far end hybrid 804' on a 
pair of the four wires 806' , and arrives at the near end 
5 after traversing the PSTN 812, and may be heard by the 
listener at the near side. In traditional telephone 
networks, an echo canceller is placed at each end of the 
PSTN in order to reduce and attempt to eliminate this 
echo . 

10 Referring now to Figure 9, a typical prior art 

digital echo canceller 900 is illustrated. The prior art 
digital echo canceller 90 0 couples between the hybrid 
circuit 804 and the public switched telephone network 
(PSTN) 902 on the telephone trunk lines. The governing 

15 specification for digital echo cancellers is the ITU-T 
recommendation G.168, Digital network echo cancellers. 
The following terms from ITU-T document G.168 are used 
herein and are illustrated in Figure 9. The end or side 
of the connection towards the local handset is referred 

20 to as the near end, near side or send side 910. The end 
or side of the connection towards the distant handset is 
referred to as the far end, far side or receive side 920. 
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The part of the circuit from the near end 910 to the far 
end 920 is the send path 930. The part of the circuit 
from the far end to the near end is the receive path 93 5. 
The part of the circuit (i.e. copper wire, hybrid) in the 
5 local loop 8 02, between the end system subscriber or 

telephone system 108 and the central -of f ice termination 
of the hybrid 804, is the end path. Speech signals 
entering the echo canceller 900 from the near end 910 are 
the send input S in . Speech signals entering the echo 

10 canceller from the far end 92 0 are the received input R in . 
Speech signals output from the echo canceller 90 0 to the 
far end 920 are the send output S out . Speech signals 
exiting the echo canceller to the near end 910 are the 
received output R ou t - 

15 The typical prior art digital echo canceller 900 

includes the basic components of an echo estimator 902, a 
digital subtractor 904, and a non-linear processor 906. 
Typically, the echo-cancellation process in the typical 
prior art digital echo canceller 900 begins by 

20 eliminating impedance mismatches. In order to do so, the 
typical digital echo canceller 900 taps the receive-side 
input signal (Rin) - Rin is processed to generate an 
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estimate of Sin in the echo estimator (902) . Sin serves 
as the reference signal for the echo cancellation 
process. Rin is also passed through to the near end 910 
without change as the R ou t signal. The echo estimator 902 
is a linear finite impulse response (FIR) convolution 
filter implemented in a DSP. The estimator 902 accepts 
successive samples of voice on Rin (typically a 16 bit 
sample every 12 5 microseconds) . The voice samples are 
multiplied with a set of filter coefficients 
approximating the impulse response of circuitry in the 
endpath to generate an echo estimation. Over time, the 
set of filter coefficients are changed (i.e. adapted) 
until they accurately represent the desired impulse 
response to form an accurate echo estimation. The echo 
estimation is coupled into the subtractor 904. If the 
echo estimation is accurate, it is substantially 
equivalent to the actual echo on S in and the output from 
the subtractor 906 into the non- linear processor has 
linear echoes substantially removed. The non-linear 
processor 906 is used to remove non-linear echo sources. 

With growing interest in providing telephony 
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communication channels over packet networks such as the 
Internet or Asynchronous Transfer Mode (ATM) , telephony 
processing has become more complicated. 



042390 -P12534 

Express Mail: EL802874811US 



9 



BRIEF DESCRIPTIONS OF THE DRAWINGS 

Figure 1A is a block diagram of a system utilizing 
the present invention. 

Figure IB is a block diagram of a printed circuit 
5 board utilizing the present invention within the gateways 
of the system in Figure 1A. 

Figure 2 is a block diagram of the Application 
Specific Signal Processor (ASSP) of the present 
invention . 

10 Figure 3 is a block diagram of an instance of the 

core processors within the ASSP of the present invention. 

Figure 4 is a block diagram of the RISC processing 
unit within the core processors of Figure 3 . 

Figure 5A is a block diagram of an instance of the 
15 signal processing units within the core processors of 
Figure 3 . 

Figure 5B is a more detailed block diagram of Figure 
5A illustrating the bus structure of the signal 
processing unit . 
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Figure 6A is an exemplary instruction sequence 
illustrating a program model for DSP algorithms employing 
the instruction set architecture of the present 
invention . 

Figure 6B is a chart illustrating the permutations 
of the dyadic DSP instructions. 

Figure 6C is an exemplary bitmap for a control 
extended dyadic DSP instruction. 

Figure 6D is an exemplary bitmap for a non- extended 
dyadic DSP instruction. 

Figure 6E and 6F list the set of 20-bit instructions 
for the ISA of the present invention. 

Figure 6G lists the set of extended control 
instructions for the ISA of the present invention. 

Figure 6H lists the set of 40-bit DSP instructions 
for the ISA of the present invention. 

Figure 61 lists the set of addressing instructions 
for the ISA of the present invention. 
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Figure 7 is a block diagram illustrating the 
instruction decoding and configuration of the functional 
blocks of the signal processing units. 

Figure 8 is a prior art block diagram illustrating a 
5 PSTN telephone network and echoes therein. 

Figure 9 is a prior art block diagram illustrating a 
typical prior art echo canceller for a PSTN telephone 
network. 

Figure 10 is a block diagram of a packet network 
10 system incorporating the integrated telecommunications 
processor of the present invention. 

Figure 11 is a block diagram of the firmware 
telecommunication processing modules of the integrated 
telecommunications processor for one of multiple full 
15 duplex channels. 

Figure 12 is a flow chart of telecommunication 
processing from the near end to the packet network. 

Figure 13 is a flow chart of the telecommunication 
processing of a packet from the network into the 
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integrated telecommunications processor into TDM signals 
at the near end. 

Figure 14A is a block diagram of the data flows and 
interaction between exemplary functional blocks of the 
integrated telecommunications processor 150 for telephony 
processing . 

Figure 14B is a flow chart of an algorithm for 
performing voice activity detection. 

Figure 14C is a flow chart of an algorithm for fast 
Fourier transform (FFT) processing of input speech for 
voice activity detection. 

Figure 14D is a flow chart for zero crossing 
detection for voice activity detection. 

Figure 14E is a flow chart of a process for noise 
detection for voice activity detection. 

Figure 14F is a flow chart of a process for energy 
discrimination for voice activity detection. 

Figure 14G is a flow chart of a process for 
instantaneous energy discrimination for voice activity 

042390. P12534 13 
Express Mail: EL802874811US 



detection . 



Figure 15 is a block diagram of exemplary memory 
maps into the memories of the integrated 
telecommunications processor 150. 

Figure 16 is a block diagram of an exemplary memory 
map for the global buffer memory of the integrated 
telecommunications processor 150. 

Figure 17 is an exemplary time line diagram of 
reception and processing time for frames of data. 

Figure 18 is an exemplary time line diagram of how 
core processors of the integrated telecommunications 
processor 150 process frames of data for multiple 
communication channels . 

Like reference numbers and designations in the 
drawings indicate like elements providing similar 
functionality. A letter or prime after a reference 
designator number represents an instance of an element 
having the reference designator number. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 



In the following detailed description of the present 
invention, numerous specific details are set forth in 
order to provide a thorough understanding of the present 
invention. However, it will be obvious to one skilled in 
the art that the present invention may be practiced 
without these specific details. In other instances well 
known methods, procedures, components, and circuits have 
not been described in detail so as not to unnecessarily 
obscure aspects of the present invention. Furthermore, 
the present invention will be described in particular 
embodiments but may be implemented in hardware, software, 
firmware or a combination thereof. 

Multiple application specific signal processors 
(ASSPs) having the instruction set architecture of the 
present invention, including dyadic DSP instructions, are 
provided within gateways in communication systems to 
provide improved voice and data communication over a 
packetized network. Each ASSP includes a serial 
interface, a host interface, a buffer memory and four 
core processors in order to simultaneously process 
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multiple channels of voice or data. Each core processor 
preferably includes a reduced instruction set computer 
(RISC) processor and four signal processing units (SPs) . 
Each SP includes multiple arithmetic blocks to 
simultaneously process multiple voice and data 
communication signal samples for communication over IP, 
ATM, Frame Relay, or other packetized network. The four 
signal processing units can execute digital signal 
processing algorithms in parallel. Each ASSP is flexible 
and can be programmed to perform many network functions 
or data/voice processing functions, including voice and 
data compression/ decompression in telecommunication 
systems (such as CODECs) , particularly packetized 
telecommunication networks, simply by altering the 
software program controlling the commands executed by the 
ASSP . 

An instruction set architecture for the ASSP is 
tailored to digital signal processing applications 
including audio and speech processing such as 
compression/ decompression and echo cancellation. The 
instruction set architecture implemented with the ASSP, 
is adapted to DSP algorithmic structures. This 
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adaptation of the ISA of the present invention to DSP 
algorithmic structures balances the ease of 
implementation, processing efficiency, and 
programmability of DSP algorithms. _The instruction set 
architecture may be viewed as being two component parts, 
one (RISC ISA) corresponding to the RISC control unit and 
another (DSP ISA) to the DSP datapaths of the signal 
processing units 300. The RISC ISA is a register based 
architecture including 16-registers within the register 
file 413, while the DSP ISA is a memory based 
architecture with efficient digital signal processing 
instructions. The instruction word for the ASSP is 
typically 2 0 bits but can be expanded to 4 0 -bits to 
control two instructions to the executed in series or 
parallel, such as two RISC control instruction and 
extended DSP instructions. The instruction set 
architecture of the ASSP has four distinct types of 
instructions to optimize the DSP operational mix. These 
are (1) a 2 0 -bit DSP instruction that uses mode bits in 
control registers (i.e. mode registers), (2) a 40-bit DSP 
instruction having control extensions that can override 
mode registers, (3) a 20-bit dyadic DSP instruction, and 
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(4) a 40 bit dyadic DSP instruction. These instructions 
are for accelerating calculations within the core 
processor of the type where D = [ (A opl B) op2 C ] and 
each of "opl" and "op2" can be a multiply, add or 
extremum (min/max) class of operation on the three 
operands A, B , and C. The ISA of the ASSP which 
accelerates these calculations allows efficient chaining 
of different combinations of operations. 

All DSP instructions of the instruction set 
architecture of the ASSP are dyadic DSP instructions to 
execute two operations in one instruction with one cycle 
throughput. A dyadic DSP instruction is a combination of 
two DSP instructions or operations in one instruction and 
includes a main DSP operation (MAIN OP) and a sub DSP 
operation (SUB OP) . Generally, the instruction set 
architecture of the present invention can be generalized 
to combining any pair of basic DSP operations to provide 
very powerful dyadic instruction combinations. The DSP 
arithmetic operations in the preferred embodiment include 
a multiply instruction (MULT) , an addition instruction 
(ADD) , a minimize/maximize instruction (MIN/MAX) also 
referred to as an extrema instruction, and a no operation 
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instruction (NOP) each having an associated operation 
code ("opcode"). 

The present invention efficiently executes these 
dyadic DSP instructions by means of the instruction set 
architecture and the hardware architecture of the 
application specific signal processor. 

Moreover, in one embodiment of the present 
invention, an integrated voice activation detector 
detects whether voice is present. The integrated voice 
activation detector includes a semiconductor integrated 
circuit having at least one signal processing unit to 
perform voice detection and a storage device to store 
signal processing instructions for execution by the at 
least one signal processing unit to: detect whether noise 
is present to determine whether a noise flag should be 
set, detect a predetermined number of zero crossings to 
determine whether a zero crossing flag should be set, 
detect whether a threshold amount of energy is present to 
determine whether an energy flag should be set, and 
detect whether instantaneous energy is present to 
determine whether an instantaneous energy flag should be 
set. Utilizing a combination of the noise, zero 
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crossing, energy, and instantaneous energy flags the 
integrated voice activation detector determines whether 
voice is present. 

Referring now to Figure 1A, a voice and data 
communication system 100 is illustrated. The system 100 
includes a network 101 which is a packetized or packet- 
switched network, such as IP, ATM, or frame relay. The 
network 101 allows the communication of voice/speech and 
data between endpoints in the system 100, using packets. 

Data may be of any type including audio, video, email, 
and other generic forms of data. At each end of the 
system 100, the voice or data requires packetizat ion when 
transceived across the network 101. The system 100 
includes gateways 104A and 104B in order to packet ize the 
information received for transmission across the network 
101. A gateway is a device for connecting multiple 
networks and devices that use different protocols. Voice 
and data information may be provided to a gateway 104 
from a number of different sources in a variety of 
digital formats. In system 100, analog voice signals are 
transceived by a telephone 108. In system 100, digital 
voice signals are transceived at public branch exchanges 
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(PBX) 112A and 112B which are coupled to multiple 
telephones, fax machines, or data modems. Digital voice 
signals are transceived between PBX 112A and PBX 112B 
with gateways 104A and 104B, respectively over the packet 
network 101. Digital data signals may also be 
transceived directly between a digital modem 114 and a 
gateway 104A. Digital modem 114 may be a Digital 
Subscriber Line (DSL) modem or a cable modem. Data 
signals may also be coupled into system 100 by a wireless 
communication system by means of a mobile unit 118 
transceiving digital signals or analog signals wirelessly 
to a base station 116. Base station 116 converts analog 
signals into digital signals or directly passes the 
digital signals to gateway 104B. Data may be transceived 
by means of modem signals over the plain old telephone 
system (POTS) 107B using a modem 110. Modem signals 
communicated over POTS 107B are traditionally analog in 
nature and are coupled into a switch 10 6B of the public 
switched telephone network (PSTN) . At the switch 106B, 
analog signals from the POTS 107B are digitized and 
transceived to the gateway 104B by time division 
multiplexing (TDM) with each time slot representing a 
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channel and one DSO input to gateway 104B. At each of 
the gateways 104A and 104B, incoming signals are 
packetized for transmission across the network 101. 
Signals received by the gateways 104A and 104B from the 
network 101 are depacketized and transcoded for 
distribution to the appropriate destination. 

Referring now to Figure IB, a network interface card 
(NIC) 130 of a gateway 104 is illustrated. The NIC 130 
includes one or more application-specific signal 
processors (ASSPs) 150A-150N. The number of ASSPs within 
a gateway is expandable to handle additional channels. 
Line interface devices 131 of NIC 130 provide interfaces 
to various devices connected to the gateway, including 
the network 101. In interfacing to the network 101, the 
line interface devices packetize data for transmission 
out on the network 101 and depacketize data which is to 
be received by the ASSP devices. Line interface devices 
131 process information received by the gateway on the 
receive bus 134 and provides it to the ASSP devices. 
Information from the ASSP devices 150 is communicated on 
the transmit bus 13 2 for transmission out of the gateway. 
A traditional line interface device is a multi-channel 
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serial interface or a UTOPIA device. The NIC 13 0 couples 
to a gateway backplane/network interface bus 136 within 
the gateway 104. Bridge logic 138 transceives 
information between bus 136 and NIC 130. Bridge logic 
138 transceives signals between the NIC 13 0 and the 
backplane/network interface bus 13 6 onto the host bus 13 9 
for communication to either one or more of the ASSP 
devices 150A-150N, a host processor 140, or a host memory 
142. Optionally coupled to each of the one or more ASSP 
devices 150A through 150N (generally referred to as ASSP 
150) are optional local memory 145A through 145N 
(generally referred to as optional local memory 145) , 
respectively. Digital data on the receive bus 134 and 
transmit bus 132 is preferably communicated in bit wide 
fashion. While internal memory within each ASSP may be 
sufficiently large to be used as a scratchpad memory, 
optional local memory 145 may be used by each of the 
ASSPs 150 if additional memory space is necessary. 

Each of the ASSPs 150 provide signal processing 
capability for the gateway. The type of signal 
processing provided is flexible because each ASSP may 
execute differing signal processing programs. Typical 
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signal processing and related voice packetization 
functions for an ASSP include (a) echo cancellation; (b) 
video, audio, and voice/speech compression/decompression 
(voice/speech coding and decoding) ; (c) delay handling 
(packets, frames) ; (d) loss handling; (e) connectivity 
(LAN and WAN) ; (f ) security (encrypt ion/ decrypt ion) ; (g) 
telephone connectivity; (h) protocol processing 
(reservation and transport protocols, RSVP, TCP/IP, RTP, 
UDP for IP, and AAL2 , AAL1 , AAL5 for ATM); (i) filtering; 
(j) Silence suppression; (k) length handling (frames, 
packets) ; and other digital signal processing functions 
associated with the communication of voice and data over 
a communication system. Each ASSP 15 0 can perform other 
functions in order to transmit voice and data to the 
various endpoints of the system 10 0 within a packet data 
stream over a packet i zed network. 

Referring now to Figure 2, a block diagram of the 
ASSP 150 is illustrated. At the heart of the ASSP 150 
are four core processors 200A-200D. Each of the core 
processors 2 00A-200D is respectively coupled to a data 
memory 2 02A-2 02D and a program memory 2 04A-2 04D. Each of 
the core processors 200A-200D communicates with outside 
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channels through the multi-channel serial interface 206, 
the multi-channel memory movement engine 2 08, buffer 
memory 210, and data memory 202A-202D. The ASSP 150 
further includes an external memory interface 212 to 
couple to the external optional local memory 145. The 
ASSP 150 includes an external host interface 214 for 
interfacing to the external host processor 140 of Figure 
IB. - Further included within the ASSP 150 are timers 
216, clock generators and a phase-lock loop 218, 
miscellaneous control logic 220, and a Joint Test Action 
Group (JTAG) test access port 222 for boundary scan 
testing. The multi-channel serial interface 206 may be 
replaced with a UTOPIA parallel interface for some 
applications such as ATM. The ASSP 150 further includes a 
microcontroller 223 to perform process scheduling for the 
core processors 200A-200D and the coordination of the 
data movement within the ASSP as well as an interrupt 
controller 2 24 to assist in interrupt handling and the 
control of the ASSP 150. 

Referring now to Figure 3, a block diagram of the 
core processor 200 is illustrated coupled to its 
respective data memory 202 and program memory 204. Core 
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processor 200 is the block diagram for each of the core 
processors 2 0 0A-200D. Data memory 202 and program memory 
2 04 refers to a respective instance of data memory 2 02A- 
202D and program memory 204A-204D, respectively. The 
core processor 200 includes four signal processing units 
SP0 300A, SP1 300B, SP2 300C and SP3 300D. The core 
processor 200 further includes a reduced instruction set 
computer (RISC) control unit 3 02 and a pipeline control 
unit 304. The signal processing units 300A-300D perform 
the signal processing tasks on data while the RISC 
control unit 302 and the pipeline control unit 304 
perform control tasks related to the signal processing 
function performed by the SPs 300A-300D. The control 
provided by the RISC control unit 3 02 is coupled with the 
SPs 300A-300D at the pipeline level to yield a tightly 
integrated core processor 2 00 that keeps the utilization 
of the signal processing units 300 at a very high level. 

The signal processing tasks are performed on the 
datapaths within the signal processing units 300A-300D. 
The nature of the DSP algorithms are such that they are 
inherently vector operations on streams of data, that 
have minimal temporal locality {data reuse) . Hence, a 
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data cache with demand paging is not used because it 
would not function well and would degrade operational 
performance. Therefore, the signal processing units 
300A-300D are allowed to access vector elements (the 
5 operands) directly from data memory 2 02 without the 
overhead of issuing a number of load and store 
instructions into memory resulting, in very efficient 
data processing. Thus, the instruction set architecture 
of the present invention having a 20 bit instruction word 

10 which can be expanded to a 40 bit instruction word, 
achieves better efficiencies than VLIW architectures 
using 256-bits or higher instruction widths by adapting 
the ISA to DSP algorithmic structures. The adapted ISA 
leads to very compact and low-power hardware that can 

15 scale to higher computational requirements. The operands 
that the ASSP can accommodate are varied in data type and 
data size. The data type may be real or complex, an 
integer value or a fractional value, with vectors having 
multiple elements of different sizes. The data size in 

2 0 the preferred embodiment is 64 bits but larger data sizes 
can be accommodated with proper instruction coding. 

Referring now to Figure 4, a detailed block diagram 
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of the RISC control unit 302 is illustrated. RISC 
control unit 3 02 includes a data aligner and formatter 
402, a memory address generator 404, three adders 406A- 
406C, an arithmetic logic unit (ALU) 408, a multiplier 
410, a barrel shifter 412, and a register file 413. The 
register file 413 points to a starting memory location 
from which memory address generator 4 04 can generate 
addresses into data memory 202. The RISC control unit 
302 is responsible for supplying addresses to data memory 
so that the proper data stream is fed to the signal 
processing units 300A-300D. The RISC control unit 302 is 
a register to register organization with load and store 
instructions to move data to and from data memory 202. 
Data memory addressing is performed by RISC control unit 
using a 32-bit register as a pointer that specifies the 
address, post-modification offset, and type and permute 
fields. The type field allows a variety of natural DSP 
data to be supported as a "first class citizen" in the 
architecture. For instance, the complex type allows 
direct operations on complex data stored in memory 
removing a number of bookkeeping instructions. This is 
useful in supporting QAM demodulators in data modems very 
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efficiently. 

Referring now to Figure 5A, a block diagram of a 
signal processing unit 300 is illustrated which 
represents an instance of the SPs 300A-300D. Each of the 
signal processing units 3 00 includes a data typer and 
aligner 502, a first multiplier Ml 504A, a compressor 
506, a first adder Al 51 OA, a second adder A2 51 0B, an 
accumulator register 512, a third adder A3 510C, and a 
second multiplier M2 504B. Adders 510A-510C are similar 
in structure and are generally referred to as adder 510. 

Multipliers 5 04A and 504B are similar in structure and 
generally referred to as multiplier 504. Each of the 
multipliers 504A and 504B have a multiplexer 514A and 
514B respectively at its input stage to multiplex 
different inputs from different busses into the 
multipliers. Each of the adders 510A, 510B, 510C also 
have a multiplexer 520A, 520B, and 520C respectively at 
its input stage to multiplex different inputs from 
different busses into the adders. These multiplexers and 
other control logic allow the adders, multipliers and 
other components within the signal processing units 3 0 0A- 
3 00C to be flexibly interconnected by proper selection of 
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multiplexers. In the preferred embodiment, multiplier Ml 
504A, compressor 506, adder Al 510A, adder A2 510B and 
accumulator 512 can receive inputs directly from external 
data buses through the data typer and aligner 502. In 
5 the preferred embodiment, adder 510C and multiplier M2 
504B receive inputs from the accumulator 512 or the 
outputs from the execution units multiplier Ml 504A, 
compressor 506, adder Al 510A, and adder A2 510B. 

Program memory 2 04 couples to the pipe control 3 04 

10 which includes an instruction buffer that acts as a local 
loop cache. The instruction buffer in the preferred 
embodiment has the capability of holding four 
instructions. The instruction buffer of the pipe control 
304 reduces the power consumed in accessing the main 

15 memories to fetch instructions during the execution of 
program loops. 

Referring now to Figure 5B, a more detailed block 
diagram of the functional blocks and the bus structure of 
the signal processing unit is illustrated. Dyadic DSP 
2 0 instructions are possible because of the structure and 
functionality provided in each signal processing unit. 
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Output signals are coupled out of the signal processor 
300 on the Z output bus 53 2 through the data typer and 
aligner 502. Input signals are coupled into the signal 
processor 300 on the X input bus 531 and Y input bus 533 
through the data typer and aligner 502. Internally, the 
data typer and aligner 502 has a different data bus to 
couple to each of multiplier Ml 504A, compressor 506, 
adder Al 510A, adder A2 510B, and accumulator register AR 
512. While the data typer and aligner 502 could have 
data busses coupling to the adder A3 510C and the 
multiplier M2 504B, in the preferred embodiment it does 
not in order to avoid extra data lines and conserve area 
usage of an integrated circuit. Output data is coupled 
from the accumulator register AR 512 into the data typer 
and aligner 502. Multiplier Ml 504A has buses to couple 
its output into the inputs of the compressor 506, adder 
Al 510A, adder A2 510B, and the accumulator registers AR 
512. Compressor 506 has buses to couple its output into 
the inputs of adder Al 510A and adder A2 510B. Adder Al 
51 OA has a bus to couple its output into the accumulator 
registers 512. Adder A2 510B has buses to couple its 
output into the accumulator registers 512. Accumulator 
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registers 512 has buses to couple its output into 
multiplier M2 504B, adder A3 510C, and data typer and 
aligner 502. Adder A3 510C has buses to couple its 
output into the multiplier M2 504B and the accumulator 
5 registers 512. Multiplier M2 504B has buses to couple 
its output into the inputs of the adder A3 510C and the 
accumulator registers AR 512. 



INSTRUCTION SET ARCHITECTURE 
The instruction set architecture of the ASSP 150 is 
tailored to digital signal processing applications 
including audio and speech processing such as 
compression/decompression and echo cancellation. In 
essence, the instruction set architecture implemented 
with the ASSP 150, is adapted to DSP algorithmic 
structures. The adaptation of the ISA of the present 
invention to DSP algorithmic structures is a balance 
between ease of implementation, processing efficiency, 
and programmability of DSP algorithms. The ISA of the 
present invention provides for data movement operations, 
DSP/arithmetic/logical operations, program control 
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operations (such as function calls/returns, 
unconditional/conditional jumps and branches) , and system 
operations (such as privilege, interrupt/trap/hazard 
handling and memory management control) . 

5 Referring now to Figure 6A, an exemplary instruction 

sequence 600 is illustrated for a DSP algorithm program 
model employing the instruction set architecture of the 
present invention. The instruction sequence 600 has an 
outer loop 601 and an inner loop 602. Because DSP 

10 algorithms tend to perform repetitive computations, 

instructions 605 within the inner loop 602 are executed 
more often than others. Instructions 603 are typically 
parameter setup code to set the memory pointers, provide 
for the setup of the outer loop 601, and other 2X2 0 

15 control instructions. Instructions 607 are typically 
context save and function return instructions or other 
2X20 control instructions. Instructions 603 and 607 are 
often considered overhead instructions which are 
typically infrequently executed. Instructions 604 are 

20 typically to provide the setup for the inner loop 602, 
other control through 2x2 0 control instructions, or 
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offset extensions for pointer backup. Instructions 606 
typically provide tear down of the inner loop 602, other 
control through. 2x2 0 control instructions, and combining 
of datapath results within the signal processing units. 
5 Instructions 605 within the inner loop 602 typically 

provide inner loop execution of DSP operations, control 
of the four signal processing units 3 00 in a single 
instruction multiple data execution mode, memory access 
for operands, dyadic DSP operations, and other DSP 

10 functionality through the 20/40 bit DSP instructions of 
the ISA of the present invention. Because instructions 
605 are so often repeated, significant improvement in 
operational efficiency may be had by providing the DSP 
instructions, including general dyadic instructions and 

15 dyadic DSP instructions, within the ISA of the present 
invention . 

The instruction set architecture of the ASSP 150 can 
be viewed as being two component parts, one (RISC ISA) 
corresponding to the RISC control unit and another (DSP 
2 0 ISA) to the DSP datapaths of the signal processing units 
300. The RISC ISA is a register based architecture 
including sixteen registers within the register file 413, 
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while the DSP ISA is a memory based architecture with 
efficient digital signal processing instructions. The 
instruction word for the ASSP is typically 20 bits but 
can be expanded to 40-bits to control two RISC or DSP 
5 instructions to be executed in series or parallel, such 
as a RISC control instruction executed in parallel with a 
DSP instruction, or a 40 bit extended RISC or DSP 
instruction . 

The instruction set architecture of the ASSP 150 has 
10 4 distinct types of instructions to optimize the DSP 

operational mix. These are (1) a 20-bit DSP instruction 
that uses mode bits in control registers (i.e. mode 
registers), (2) a 40-bit DSP instruction having control 
extensions that can override mode registers, (3) a 20-bit 
15 dyadic DSP instruction, and (4) a 40 bit dyadic DSP 

instruction. These instructions are for accelerating 
calculations within the core processor 200 of the type 
where D = [ (A opl B) op2 C ] and each of "opl" and "op2" 
can be a multiply, add or extremum (min/max) class of 
2 0 operation on the three operands A, B, and C. The ISA of 
the ASSP 150 which accelerates these calculations allows 
efficient chaining of different combinations of 
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operations. Because these type of operations require 
three operands, they must be available to the processor. 

However, because the device size places limits on the 
bus structure, bandwidth is limited to two vector reads 
5 and one vector write each cycle into and out of data 
memory 202. Thus one of the operands, such as B or C, 
needs to come from another source within the core 
processor 200. The third operand can be placed into one 
of the registers of the accumulator 512 or the RISC 

10 register file 413 . In order to accomplish this within 
the core processor 200 there are two subclasses of the 
20-bit DSP instructions which are (1) A and B specified 
by a 4-bit specifier, and C and D by a 1-bit specifier 
and (2) A and C specified by a 4 -bit specifier, and B and 

15 D by a 1 bit specifier. 

Instructions for the ASSP are always fetched 4 0 -bits 
at a time from program memory with bit 3 9 and 19 
indicating the type of instruction. After fetching, the 
instruction is grouped into two sections of 2 0 bits each 

20 for execution of operations. In the case of 20-bit 

control instructions with parallel execution (bit 39=0, 
bit 19=0) , the two 20-bit sections are control 
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instructions that are executed simultaneously. In the 
case of 2 0 -bit control instructions for serial execution 
(bit 39=0, bit 19=1), the two 20-bit sections are control 
instructions that are executed serially. In the case of 
5 20-bit DSP instructions for serial execution (bit 39=1, 
bit 19=1) , the two 20-bit sections are DSP instructions 
that are executed serially. In the case of 40-bit DSP 
instructions (bit 39=1, bit 19 = 0) , the two 20 bit 
sections form one extended DSP instruction which are 

10 executed simultaneously. 

The ISA of the ASSP 150 is fully predicated 
providing for execution prediction. Within the 20-bit 
RISC control instruction word and the 4 0 -bit extended DSP 
instruction word there are 2 bits of each instruction 

15 specifying one of four predicate registers within the 

RISC control unit 302. Depending upon the condition of 
the predicate register, instruction execution can 
conditionally change base on its contents. 

In order to access operands within the data memory 

20 202 or registers within the accumulator 512 or register 
file 413, a 6-bit specifier is used in the DSP extended 
instructions to access operands in memory and registers. 
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Of the six bit specifier used in the extended. DSP 
instructions, the MSB (Bit 5) indicates whether the 
access is a memory access or register access. In the 
preferred embodiment, if Bit 5 is set to logical one, it 
5 denotes a memory access for an operand. If Bit 5 is set 
to a logical zero, it denotes a register access for an 
operand. If Bit 5 is set to 1, the contents of a 
specified register (rX where X: 0-7) are used to obtain 
the effective memory address and post -modify the pointer 

10 field by one of two possible offsets specified in one of 
the specified rX registers. If Bit 5 is set to 0, Bit 4 
determines what register set has the contents of the 
desired operand. If Bit-4 is set to 0, then the 
remaining specified bits 3:0 control access to the 

15 registers within the register file 413 or to registers 
within the signal processing units 300. 



DSP INSTRUCTIONS 
There are four major classes of DSP instructions for 
20 the ASSP 150 these are : 

1) Multiply (MULT) : Controls the execution of the main 
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multiplier connected to data buses from memory. 
Controls: Rounding, sign of multiply 

Operates on vector data specified through type field in 
address register 
5 Second operation: Add, Sub, Min, Max in vector or scalar 
mode 

2) Add (ADD) : Controls the execution of the main-adder 
Controls: absolute value control of the inputs, limiting 

10 the result 

Second operation: Add, add-sub, mult, mac, min, max 

3) Extremum (MIN/MAX) : Controls the execution of the 
ma in -adder 

15 Controls: absolute value control of the inputs, Global 

or running max/min with T register, TR register recording 
control 

Second operation: add, sub, mult, mac, min, max 
2 0 4) Misc: type-match and permute operations. 

The ASSP 15 0 can execute these DSP arithmetic 
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operations in vector or scalar fashion. In scalar 
execution, a reduction or combining operation is 
performed on the vector results to yield a scalar result. 
It is common in DSP applications to perform scalar 
5 operations, which are efficiently performed by the ASSP 
150 . 

The 2 0 -bit DSP instruction words have 4 -bit operand 
specifiers that can directly access data memory using 8 
address registers (r0-r7) within the register file 413 of 

10 the RISC control unit 302. The method of addressing by 
the 20 bit DSP instruction word is regular indirect with 
the address register specifying the pointer into memory, 
post -modification value, type of data accessed and 
permutation of the data needed to execute the algorithm 

15 efficiently. All of the DSP instructions control the 

multipliers 504A-504B, adders 510A-510C, compressor 506 
and the accumulator 512, the functional units of each 
signal processing unit 300A-300D. 

In the 40 bit instruction word, the type of 

20 extension from the 2 0 bit instruction word falls into 
five categories: 

1) Control and Specifier extensions that override the 
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control bits in mode registers 

2) Type extensions that override the type specifier in 
address registers 

3) Permute extensions that override the permute specifier 
5 for vector data in address registers 

4) Offset extensions that can replace or extend the 
offsets specified in the address registers 

5) DSP extensions that control the lower rows of 
functional units within a signal processing unit 300 to 

10 accelerate block processing. 



The 40-bit control instructions with the 20 bit 
extensions further allow a large immediate value (16 to 
20 bits) to be specified in the instruction and powerful 

15 bit manipulation instructions. 

Efficient DSP execution is provided with 2x20-bit 
DSP instructions with the first 2 0 -bits controlling the 
top functional units (adders 501A and 510B, multiplier 
504A, compressor 506) that interface to data buses from 

2 0 memory and the second 2 0 bits controlling the bottom 

functional units (adder 510C and multiplier 504B) that 
use internal or local data as operands . The top 
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functional units, also referred to as main units, reduce 
the inner loop cycles in the inner loop 602 by- 
parallelizing across consecutive taps or sections. The 
bottom functional units cut the outer loop cycles in the 
5 outer loop 601 in half by parallelizing block DSP 
algorithms across consecutive samples. 

Efficient DSP execution is also improved by the 
hardware architecture of the present invention. In this 
case, efficiency is improved in the manner that data is 

10 supplied to and from data memory 2 02 to feed the four 

signal processing units 300 and the DSP functional units 
therein. The data highway is comprised of two buses, X 
bus 531 and Y bus 53 3, for X and Y source operands, and 
one Z bus 532 for a result write. All buses, including X 

15 bus 531, Y bus 533, and Z bus 532, are preferably 64 bits 
wide. The buses are uni-directional to simplify the 
physical design and reduce transit times of data. In the 
preferred embodiment when in a 20 bit DSP mode, if the X 
and Y buses are both carrying operands read from memory 

20 for parallel execution in a signal processing unit 300, 
the parallel load field can only access registers within 
the register file 413 of the RISC control unit 302. 
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Additionally, the four signal processing units 300A-300D 
in parallel provide four parallel MAC units (multiplier 
504A, adder 510A, and accumulator 512) that can make 
simultaneous computations. This reduces the cycle count 
5 from 4 cycles ordinarily required to perform four MACs to 
only one cycle. 



DYADIC DSP INSTRUCTIONS 
All DSP instructions of the instruction set 

10 architecture of the ASSP 150 are dyadic DSP instructions 
within the 2 0 bit or 40 bit instruction word. A dyadic 
DSP instruction informs the ASSP in one instruction and 
one cycle to perform two operations. Referring now to 
Figure 6B is a chart illustrating the permutations of the 

15 dyadic DSP instructions. The dyadic DSP instruction 610 
includes a main DSP operation 611 (MAIN OP) and a sub DSP 
operation 612 (SUB OP) , a combination of two DSP 
instructions or operations in one dyadic instruction. 
Generally, the instruction set architecture of the 

20 present invention can be generalized to combining any 
pair of basic DSP operations to provide very powerful 
dyadic instruction combinations. Compound DSP 
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operational instructions can provide uniform acceleration 
for a wide variety of DSP algorithms not just multiply- 
accumulate intensive filters. The DSP instructions or 
operations in the preferred embodiment include a multiply 
instruction (MULT) , an addition instruction (ADD) , a 
minimize/maximize instruction (MIN/MAX) also referred to 
as an extreraa instruction, and a no operation instruction 
(NOP) each having an associated operation code 
("opcode"). Any two DSP instructions can be combined 
together to form a dyadic DSP instruction. The NOP 
instruction is used for the MAIN OP or SUB OP when a 
single DSP operation is desired to be executed by the 
dyadic DSP instruction. There are variations of the 
general DSP instructions such as vector and scalar 
operations of multiplication or addition, positive or 
negative multiplication, and positive or negative 
addition (i.e. subtraction). 

Referring now to Figure 6C and Figure 6D, bitmap 
syntax for an exemplary dyadic DSP instruction is 
illustrated. Figure 6C illustrates bitmap syntax for a 
control extended dyadic DSP instruction while Figure 6D 
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illustrates bitmap syntax for a non-extended dyadic DSP 
instruction. In the non- extended bitmap syntax the 
instruction word is the twenty most significant bits of a 
forty bit word while the extended bitmap syntax has an 
instruction word of forty bits. The three most 
significant bits (MSBs) , bits numbered 37 through 39, in 
each indicate the MAIN OP instruction type while the SUB 
OP is located near the middle or end of the instruction 
bits at bits numbered 20 through 22. In the preferred 
embodiment, the MAIN OP instruction codes are 000 for 
NOP, 101 for ADD, 110 for MIN/MAX, and 100 for MULT. The 
SUB OP code for the given DSP instruction varies 
according to what MAIN OP code is selected. In the case 
of MULT as the MAIN OP, the SUB OPs are 000 for NOP, 001 
or 010 for ADD, 100 or 011 for a negative ADD or 
subtraction, 101 or 110 for MIN, and 111 for MAX. In the 
preferred embodiment, the MAIN OP and the SUB OP are not 
the same DSP instruction although alterations to the 
hardware functional blocks could accommodate it. The 
lower twenty bits of the control extended dyadic DSP 
instruction, the extended bits, control the signal 
processing unit to perform rounding, limiting, absolute 
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value of inputs for SUB OP, or a global MIN/MAX operation 
with a register value. 

The bitmap syntax of the dyadic DSP instruction can 
be converted into text syntax for program coding. Using 
5 the multiplication or MULT non-extended instruction as an 
example, its text syntax for multiplication or MULT is 

(vmul | vmuln) . (vadd | vsub | vmax | sadd | ssub | smax) da, sx, 
sa, sy [, (psO) |psl) ] 

The "vmul | vmuln" field refers to either positive vector 
10 multiplication or negative vector multiplication being 
selected as the MAIN OP. The next field, 

"vadd | vsub | vmax | sadd | ssub | smax" , refers to either vector 
add, vector subtract, vector maximum, scalar add, scalar 
subtraction, or scalar maximum being selected as the SUB 

15 OP. The next field, "da", refers to selecting one of the 
registers within the accumulator for storage of results. 

The field "sx" refers to selecting a register within the 
RISC register file 413 which points to a memory location 
in memory as one of the sources of operands . The field 

20 "sa" refers to selecting the contents of a register 
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within the accumulator as one of the sources of operands. 
The field "sy" refers to selecting a register within the 
RISC register file 413 which points to a memory location 
in memory as another one of the sources of operands . The 
5 field of (psO) |psl)]" refers to pair selection of 
keyword PSO or PS1 specifying which are the source- 
destination pairs of a parallel-store control register. 
Referring now to Figure 6E and 6F, lists of the set of 
20-bit DSP and control instructions for the ISA of the 

10 present invention is illustrated. Figure 6G lists the 
set of extended control instructions for the ISA of the 
present invention. Figure 6H lists the set of 40-bit DSP 
instructions for the ISA of the present invention. 
Figure 61 lists the set of addressing instructions for 

15 the ISA of the present invention. 

Referring now to Figure 7, a block diagram 
illustrates the instruction decoding for configuring the 
blocks of the signal processing unit 300. The signal 
processor 300 includes the final decoders 704A through 

2 0 7 04N, and multiplexers 7 2 OA through 72 ON. The 

multiplexers 720A through 720N are representative of the 
multiplexers 514, 516, 52 0, and 522 in Figure 5B . The 
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predecoding 702 is provided by the RISC control unit 302 
and the pipe control 304. An instruction is provided to 
the predecoding 702 such as a dyadic DSP instruction 600. 

The predecoding 702 provides preliminary signals to the 
appropriate final decoders 7 04A through 7 0 4N on how the 
multiplexers 72 OA through 72 ON are to be selected for the 
given instruction. Referring back to Figure 5B, in a 
dyadic DSP instruction the MAIN OP generally, if not a 
NOP, is performed by the blocks of the multiplier Ml 
504A, compressor 506, adder Al 510A, and adder A2 510B. 
The result is stored in one of the registers within the 
accumulator register AR 512. In the dyadic DSP 
instruction the SUB OP generally, if not a NOP, is 
performed by the blocks of the adder A3 510C and the 
multiplier M2 504B. For example, if the dyadic DSP 
instruction is to perform is an ADD and MULT, then the 
ADD operation of the MAIN OP is performed by the adder Al 
510A and the SUB OP is performed by the multiplier Ml 
504A. The predecoding 72 0 and the final decoders 704A 
through 7 04N appropriately select the respective 
multiplexers 72 OA through 72 0B to select the MAIN OP to 
be performed by the adder Al 51 OA and the SUB OP to be 
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performed by the multiplier M2 504B. In the exemplary 
case, multiplexer 52 OA selects inputs from the data typer 
and aligner 5 02 in order for adder Al 51 OA to perform the 
ADD operation, multiplexer 522 selects the output from 
adder 510A for accumulation in the accumulator 512, and 
multiplexer 514B selects outputs from the accumulator 512 
as its inputs to perform the MULT SUB OP. The MAIN OP 
and SUB OP can be either executed sequentially (i.e. 
serial execution on parallel words) or in parallel (i.e. 
parallel execution on parallel words) . If implemented 
sequentially, the result of the MAIN OP may be an operand 
of the SUB OP. The final decoders 704A through 704N have 
their own control logic to properly time the sequence of 
multiplexer selection for each element of the signal 
processor 300 to match the pipeline execution of how the 
MAIN OP and SUB OP are executed, including sequential or 
parallel execution. The RISC control unit 3 02 and the 
pipe control 304 in conjunction with the final decoders 
704A through 704N pipelines instruction execution by 
pipelining the instruction itself and by providing 
pipelined control signals. This allows for the data path 
to be reconfigured by the software instructions each 

042390. P12534 49 
Express Mail: EL802874811US 



cycle . 



TELECOMMUNICATIONS PROCESSING 
Referring now to Figure 10, a detailed system block 
diagram of the packetized telecommunication communication 
network 100' is illustrated. In the packetized 
telecommunications network 100' an end system 108A is at 
a near end while an end system 108B is at a far end. The 
end systems 108A and/or 108B can be a telephone, a fax 
machine, a modem, wireless pager, wireless cellular 
telephone or other electronic device that operates over a 
telephone communication system. The end system 10 8A 
couples to switch 106A which couples into gateway 104A. 
The end system 108B couples to switch 106B which couples 
into gateway 104B. Gateway 104A and gateway 104B couple 
to the packet network 101 to communicate voice and other 
telecommunication data between each other using packets. 

Each of the gateways 104A and 104B include network 
interface cards (NIC) 130A-130N, a system controller 
board 1010, a framer card 1012, and an Ethernet interface 
card 1014. The network interface cards (NIC) 130A-130N 
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in the gateways provide telecommunication processing for 
multiple communication channels over the packet network 
101. On one side, the NICs 130 couple packet data into 
and out of the system controller board 1010. The packet 
data is packetized and depacketized by the system 
controller board 1010. The system controller board 1010 
couples the packets of packet data into and out of the 
Ethernet interface card 1014. The Ethernet interface 
card 1014 of the gateways transmits and receives the 
packets of telecommunication data over the packet network 
101. On an opposite side, the NICs 13 0 couple time 
division multiplexed (TDM) data into and out of the 
framer card 1012. The framer card 1012 frames the data 
from multiple switches 106 as time division multiplexed 
data for coupling into the network interface cards 130. 
The framer card 1012 pulls data out of the framed TDM 
data from the network interface cards 13 0 for coupling 
into the switches 106. 

Each of the network interface cards 13 0 includes a 
micro controller (cPCI controller) 140 and one or more of 
integrated telecommunications processors 150A-150N. Each 
of the integrated telecommunications processors 150N 
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includes one or more RISC/DSP core processor 2 00, one or 
more data memory (DRAM) 202, one or more program memory 
(PRAM) 204, one or more serial TDM interface ports 206 to 
support multiple TDM channels, a bus controller or memory 
movement engine 2 08, a global or buffer memory 210, a 
host or host bus interface 214, and a microcontroller 
(MIPS) 223. Firmware flexibly controls the functionality 
of the blocks in the integrated telecommunications 
processor 150 which can vary for each individual channel 
of communication. 

Referring now to Figure 11, a block diagram of the 
firmware telecommunications processing modules of the 
application specific signal processor 150, forming the 
"integrated telecommunications processor" 150, for one of 
multiple full duplex channels is illustrated. One full 
duplex channel consists of two time-division multiplexed 
(TDM) time slots on the TDM or near side and two packet 
data channels on the packet network or far side, one for 
each direction of communication. The telecommunication 
processing provided by the firmware can provide telephony 
processing for each given channel including one or more 
of network echo cancellation 1103, dial tone detection 
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1104, voice activity detection 1105, dual-tone multi- 
frequency (DTMF) signal detection 1106; dual-tone multi- 
frequency (DTMF) signal generation 1107; dial tone 
generation 1108; G.7xxx voice encoding (i.e. compression) 
1109; G.7xxx voice decoding (i.e. decompression) 1110, 
and comfort noise generation (CNG) 1111. The firmware 
for each channel is flexible and can also provide GSM 
decoding/encoding, CDMA decoding/encoding, digital 
subscriber line (DSL) , modem services including 
modulation/ demodulation, fax services including 
modulation/demodulation and/or other functions associated 
with telecommunications services for one or more 
communication channels. While ja-Law / A- Law decoding 
1101 and )a-Law / A- Law encoding 1102 can be performed 
using firmware, in one embodiment it is implemented in 
hardware circuitry in order to speed the encoding and 
decoding of multiple communication channels. The 
integrated telecommunications processor 150 couples to 
the host processor 140 and a packet processor 1120. The 
host processor 14 0 loads the firmware into the integrated 
telecommunications processor to perform the processing 
in a voice over packet (VoP) network system or packetized 
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network system. 

The ja,-Law / A-Law decoding 1101 decodes encoded 
speech into linear speech data. The |a.-Law / A-Law 
encoding 1102 encodes linear speech data into (x-Law / A- 
5 Law encoded speech. The integrated telecommunications 
processor 150 includes hardware G.711 |j,-Law / A-Law 
decoders and p.-Law / A-Law encoders. The hardware 
conversion of A- law/ jj- law encoded signals into linear PCM 
samples and vice versa is optional depending upon the 

10 type of signals received. Using hardware for this 

conversion is preferable in order to speed the conversion 
process and handle additional communication channels. The 
TDM signals at the near end are encoded speech signals. 
The integrated telecommunications processor 150 receives 

15 TDM signals from the near end and decodes them into 

pulse- code modulated (PCM) linear data samples Si n . These 
PCM linear data samples Si n are coupled into the network 
echo- cancellation module 1103 . The network echo- 
cancellation module 1103 removes an echo estimated signal 

20 from the PCM linear data samples Si n to generate PCM 

linear data samples S ou t - The PCM linear data samples S ou t 
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are provided to the DTMF detection module 1106 and the 
voice-activity detection and comfort -noise generator 
module 1105. The output of the Network Echo Canceller 
(Sout) is coupled into the Tone Detection module 1104, 
5 the DTMF Detection module 1106, and the Voice Activity 
Detection module 1105. Control signals from the Tone 
Detection module 1104 are coupled back into the Network 
Echo Cancellation module 1103. The decoded speech 
samples from the far end are PCM linear data samples Rin 

10 and are coupled into the network echo cancellation module 
1103 . The network echo cancellation module 1103 copies 
Rin for echo cancellation purposes and passes it out as 
PCM linear data samples R ou t • The PCM linear data samples 
R out are coupled into the mu-law and A-law encoding module 

15 1102. The PCM linear data samples R out are encoded into 
mu-law and A-law encoded speech and interleaved into the 
TDM output signals of the TDM channel Output to the near 
end. The interleaving for framing of the data is 
performed after the linear to A-law/mu-law conversion by 

2 0 a Framer (not shown in Figure 11) which puts the 

individual channel data into different time slots. For 
example, for Tl signaling there are 24 such time slots 
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for each Tl frame. 

The Network Echo Cancellation module 1103 has two 
inputs and two outputs because it has full duplex 
interfaces with both the TDM channels and the packet 
5 network via the VX-Bus. The network echo cancellation 
module 1103 cancels echoes from linear as well as non- 
linear sources in the communication channel. The network 
echo cancellation module 1103 is specifically tailored to 
cancel non-linear echoes associated with the packet 

10 delays/ latency generated in the packetized network. 

The tone detection module 1104 receives both tone 
and voice signals from the network cancellation module 
1103 . The tone detection module 1104 discriminates the 
tones from the voice signals in order to determine what 

15 the tones are signaling. The tone detection module 

determines whether or not the tones from the near end are 
call progress tones (dial tone, busy tone, fast busy 
tone, etc.) signaling on-hook, ringing, off-hook or busy, 
or a fax/modem call. If a far end is dialing the near 

20 end, the call progress tones of on-hook, ringing, or off- 
hook or busy signal is translated into packet signals by 
the tone detection module for transmission over the 
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packet network to the far end. If the tone detection 
module determines that fax/modem tones are present 
indicating that the near end is initiating a fax/modem 
call, further voice processing is bypassed and the echo 
5 cancellation by the network echo cancellation module 1103 
is disabled. 

To detect tones, the tone detection module 1104 uses 
infinite impulse-response (IIR) filters and accompanying 
logic. When a FAX or modem tone signaling tone is 

10 detected, the signaling tones help control the respective 
signaling event. The tone detection module 1104 detects 
the presence of several in-band tones at specific 
frequencies, checks their cadences, signals their 
presence to the echo cancellation module 1103, and 

15 prompts other modules to take appropriate actions. The 
tone detection module 1104 and the DTMF detection module 
operate in parallel with the network echo canceller 1103 . 

The tone detection module can detect true tones with 
2 0 signal amplitude levels from 0 dB to -4 0 dB in the 
presence of a reasonable amount of noise. The tone 
detection module can detect tones within a reasonable 
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neighborhood of center frequency with detection delays 
within a prescribed limit. The tone detection module 
matches the tone cadences, as required by the tone- 
cadence rules defined by the ITU/TIA standards. To 
achieve the above properties, certain trade-offs are 
necessary in that the tone detection module must adjust 
several energy thresholds, the filter roll -off rate, and 
the filter stopband attenuation. Furthermore, the tone 
detection module is easily upgradeable to allow detection 
of additional tones simply by updating the firmware. The 
current telephony-related tones that the tone-detection 
module 1104 can detect are listed in the following table: 

• Tones the Tone-Detection Module Detects 



Tone Name 


Tone Description 


'On' Time 


'Off' Time 


FAXCED 


2100 Hz 


2.6 to 4 seconds 




Echo Cancellation 
Disable / Modem 
Tones 


2100 Hz, with phase reversal every 450 
ms 


2.6 to 4 seconds 




FAXCNG 


1100 Hz 


0.5 seconds 


3 seconds 


FAX V.21 


7E flags frequency-shift keying at 1750-Hz 
carrier. 


At least three 7E flags signal the onset of a FAX 
siqnal being sent. 


2400 Hz 


In-band signaling tones and continuity 
check tones 


G.168 Test 8 describes the performance of echo 
cancellation in the presence of these tones. 


2600 Hz 



When a 2100-Hz tone with phase reversal is detected 
indicating a V- series modem operation the echo canceller 
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is shut off temporarily. When the tone detection module 
detects facsimile tones, the echo canceller is shut off 
temporarily. The tone detection module can also detect 
the presence of narrowband signals, which can be control 
signals to control the actions of the echo cancellation 
module 1103. The tone detection modules function both 
during call set up and while the call progress through 
termination of the communication channel for the call. 
Any tone which is sent, generated, or detected before the 
actual call or communication channel is established, is 
referred to as an out-of-band tone. Tones which are 
detected during a call, after the call has been set-up, 
are referred to as in-band tones. The Tone Detector, in 
it's most general form, is capable of detecting many 
signaling tones. The tones that are detected include the 
call progress tones such as a Ringing Tone, a Busy Tone, 
a Fast Busy Tone, a Caller ID Tone, a Dial Tone, and 
other signaling tones which vary from country to country. 
The, call progress tones control the handshaking required 
to set up a call. Once a call is established, all the 
tones which are generated and detected are referred to as 
in-band tones . The same Tone Detectors and Generators 
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Blocks are used both for in-band and out -of band tone 
detection and generation. 

In most conversations, speakers only voice speech 
about 35% of the time. During the remaining 65% of the 
5 time in most conversations, a speaker is relatively 
silent due to natural pauses for emphasis, clarity, 
breathing, thought processes, and so forth. When there 
are more than two speakers, as in conference calls, there 
is even more periods of silence. It is an inefficient 

10 use of a communication channel to transmit silence from 
one end to another. Thus, statistical multiplexing 
techniques are used to allocate to other calls this 65% 
of 'quiet' time (also known as 'dead time' or 'silence') . 
Even though quiet time is allocated to other calls, the 

15 channel quality during the time that end users use the 
communication channel is preserved. 

However, silence at one end, which is not 
transmitted to an opposite end, needs to be simulated and 
inserted into the call at the opposite end. 

2 0 Sometimes when we speak over a telephone, we hear 

the echo of our own speech which we usually ignore. The 
important point is that we do hear the echo. However, 
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many digital telephone connections are so noise- free 
there is no background noise or residual echo at all. As 
a result a far-end user, hearing absolute silence, may 
think the connection is broken and hang up. 
5 To convince users there is a connection, the 

background or Comfort -Noise Generation (CNG) module 1105 
simulates silence or quite time at an end by adding 
background noise such as a comforting 'hiss' . The CNG 
module 1105 can simulate ambient background noise of 

10 varying levels. An echo-cancellation setup message can 
be used to control the CNG module as an external 
parameter. The comfort noise generation module 
alleviates the effects of switching in and out as heard 
by far-end talkers when they stop talking. The near-end 

15 noise level is used to determine an appropriate level of 
background noise to be simulated and inserted at the S 0u t 
(Send Out) Port. However before silence can be simulated 
by the CNG module 1105, it first must be detected. 

The Voice-Activity Detection (VAD) module 1105 is 

2 0 used to detect the presence or absence of silence in a 
speech segment. When the VAD module 1105 detects 
silence, background noise energy is estimated and an 

042390 .P12534 61 
Express Mail: EL802874811US 



encoder therein generates a Silence- Insert ion Description 
(SID) frame. The SID frame is transmitted to an opposite 
end to indicate that silence is to be simulated at the 
estimated background noise energy level . In response to 
receiving an SID frame at the opposite end (i.e., the Far 
End) , the CNG module 1111 generates a corresponding 
comfort noise or simulated silence for a period of time. 
Using the received level of the ambient background noise 
from the SID frame, the CNG produces a level of comfort 
noise (also called 'white noise' or v pink noise' or 
simulated silence) that replaces the typical background 
noises that have been removed, thereby assuring the far- 
end person that the connection has not been broken. The 
VAD module 1105 determines when the comfort noise is to 
be turned on (i.e. a quiet period is detected) and when 
comfort noise is to be turned off (i.e. the end user is 
talking again) . The VAD 1105 (in the Send Path) and CNG 
module 1111 (in the Receive Path) work effectively 
together at two different ends so that speech is not 
clipped during the quiet period and comfort noise is 
appropriately generated. 

The VAD module 1105 includes an Adaptive Level 
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Controller (ALC) that ensures a constant output level for 
varying levels of near-end inputs. The adaptive level 
controller includes a variable gain amplifier to maintain 
the constant output level. The adaptive level controller 
includes a near-end energy detector to detect noise in 
the near-end signal. When the near end energy detector 
detects noise in the near-end signal the ALC is disabled 
so that undesirable noise is not amplified. 

The DTMF detection module 1106 performs dual -tone 
multiple frequency detection necessary to detect DTMF 
tones as telephone signals. The DTMF detection module 
receives signals on Sout from the echo cancellation 
module 1103. The DTMF detection module 1106 is always 
active, even during normal conversation in case DTMF 
signals are transmitted during a conversation. The DTMF 
detection module does not disable echo cancellation when 
DTMF tones are detected. The DTMF detection module 
includes narrow-band filters to detect special tones and 
DTMF dialing tones. Furthermore because the G.7xxx 
speech encoding module 1109 and decoding module 1110 are 
used to compress/decompress speech signals and are not 
used for control signaling or dialing tones, the DTMF 
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detection module may be used as appropriate to control 
sequencing, loading, and the execution of CODEC firmware. 

The DTMF detection module 1106 detects the DTMF 
tones and includes a decoder to decode the tones to 
determine which telephone keypad button was pressed. The 
DTMF detection module 1106 is based on a Goertzel 
algorithm and meets all conditions of the Bellcore DTMF 
decoder tests as well as Mitel decoder tests. 

The DTMF detection module 1106 indicates which 
dialpad key a sender has pressed after processing a few 
frames of data. The DTMF detection module can be adapted 
to receive user-defined parameters. The user defined 
parameters can be varied to optimize the DTMF detector 
for specific receiving conditions such as the thresholds 
for both of the frequencies made up by the 'rows' and 
'columns' of the DTMF keypad, thresholds for acceptable 
twist ratios (the ratio of powers between the higher and 
lower frequencies), silence level, signal-to-noise 
ratios, and harmonic ratios. 

The DTMF generation module 110 7 provides dual -tone 
multiple frequency (DTMF) generation necessary to 
generate DTMF tones for telephone signals. The encoding 
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process in the DTMF generation module 1107 generates one 
of the various pairs of DTMF tones. The DTMF generation 
module 1107 generates digitized dual-tone multi-frequency 
samples for a dialpad key depression at the far end. The 
DTMF generation module 1107 is also always active, even 
during normal conversation. The DTMF generation module 
1107 includes narrow-band filters to generate special 
tones and DTMF dialing tones. The DTMF generation module 
1107 receives a DTMF packet from the far end over the 
packet network. The DTMF generation module 1107 includes 
a DTMF decoder to decode the DTMF packet and properly 
generate tones. The DTMF packet payload includes such 
information as the key or digit that was pressed that is 
to be played (i.e. dialpad key coordinates), duration to 
be played (Number of successive 125 microsecond samples 
during which the tone is enabled and Number of successive 
12 5 microsecond samples during which the tone is shut off 
disabled) , amplitude level (Lower- frequency amplitude 
level in dB and Upper- frequency amplitude level in dB) 
and other information. By specifying these parameters, 
the DTMF generation module 1107 can generate DTMF 
signaling tones having the required signal amplitude 
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levels and timing for the appropriate digit/tone. The 
DTMF tones generated by the DTMF generation module 1107 
are coupled into the echo canceller on Ri n . 

The tone generation module 1108 operates similar to 
the DTMF generation module 1107 but generates the 
specific tones that provide telephony signals. The tones 
generated by the tone generation module include tones to 
signal On-hook/of f -hook, Ringing, Busy, and special tones 
to signal FAX/modem calls. A tone packet is received 
from the far end over the packet network and is decoded 
and the parameters of the tone are determined. The tone 
generation module 1108 generates tone similar to the DTMF 
generation module 1107 previously described using 
narrowband filters. 

The G.7xx encoding module 1109 provides speech 
compression before being packetized. The G.7xx encoding 
module 1109 receives speech in a linear, 64-Kbps pulse- 
code modulation (PCM) format from the network echo 
cancellation module 1103. The speech is compressed by 
the G.7xx encoding module 1109 using one of the 
compression standards specified for low bit-rate voice 
(LBRV) CODECS, including the ITU-T internationally 
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standardized G . 7xx series. Many speech CODECS can be 
chosen. However, the selected speech CODEC determines the 
block size of speech samples and the algorithmic delay. 
Of several industry- standard speech CODECS in use, each 
implements a different combination of Coding rate, Frame 
length (the size of the speech sample block) , and 
Algorithmic delay (or detection delay) caused by how long 
it takes all samples to be gathered for processing. 

The G . 7 xx decoding module 1110 provides speech 
decompression of signals received from the far end over 
the packet network. The decompressed speech is coupled 
into the network echo cancellation module 1103. The 
decompression algorithm of the G.7xx decoding module 1110 
needs to match the compression algorithm of the G.7xx 
encoding module 110 9. The G.7xx decoding module 1110 and 
the G . 7xx encoding module 1109 are referred to as a CODEC 
(coder-decoder) . Currently, there are several industry- 
standard speech CODECS from which to pick. The 
parameters for selection of a CODEC are previously 
described. The ITU CODECS include G.711, G.722, G. 723.1, 
G.726, G.727, G.728, G.729, G.729A, and G.728E. Each of 
these can easily be selected by choice of firmware. 
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Data enters and leaves the processor 150 through the 
TDM serial I/O ports and a 32-bit parallel VX-Bus 1112. 
Data processing in the processor 150 is performed using 
16-bits of precision. The companded 8-bit PCM data on 
5 the TDM channel input is converted into 16 -bit linear PCM 
for processing in the processor 150 and is re-converted 
back into 8 -bit PCM for outputting on the TDM channel 
output . 

Referring now to Figure 12, a flow chart diagram of 
10 the telephony processing of linear data (Si n ) from a near 
end to packet data on the network side at a far end is 
illustrated. Near in data Si n is provided to the 
integrated telecommunications processor 150. At step 
12 01, a determination is made whether the echo 
15 cancellation module 1103 is enabled or not. If the echo 
cancellation module 1103 is not enabled, the integrated 
telecommunications processor 150 jumps to the tone 
detection module 12 05 which detects the presence or 
absence of in-band tones in the Sin signal. If the echo 
20 cancellation module 1103 is enabled at step 1201, the 
near in data Si n is coupled into the echo cancellation 
module 10 03 at step 12 0 3 and data from the far end 
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(Farln) is utilized to cancel out echoes. After echo 
cancellation is performed at step 12 03 and/or if the echo 
cancellation module 1103 is enabled, the integrated 
telecommunications processor 150 jumps to the tone 
5 detection step 1205 where the data is coupled into tone 
detection module 1104. The processor 150 goes to step 
1207 . 

At step 12 07, a determination is made whether a fax 
tone is present. If the fax tone is present at step 
10 1207, the integrated telecommunications processor 150 

jumps to step 1209 to provide fax processing. If no fax 
tone is present at step 1207, further interpretation of 
the result by the tone detection module occurs at step 
1211 . 

15 At step 1211, a determination is made whether there 

is an echo cancellation control tone to indicate the 
Enabling and Disabling of the Echo Canceller. If an Echo 
cancellation control tone is present, integrated 
telecommunications processor jumps to step 1215. If no 

20 echo cancellation control tone is detected at step 1211, 
the incoming data signal Sin may be a voice or speech 
signal and the integrated telecommunications processor 
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jumps to the VAD module at step 1219. 

At step 1215 the energy of the Tone is compared to a 
predetermined threshold. A determination is made whether 
or not the energy level in the signal S in is less than a 
5 threshold level. If the energy of the Tone on S in is 
greater than or equal to this predetermined threshold, 
the processor jumps to step 1213. If the energy of the 
Tone on Si n is less than the threshold level, the 
integrated telecommunications processor 15 0 jumps to step 
10 1217. 

At step 1213, the echo cancellation disable tone has 
been detected and the energy of the tone is greater than 
a given predetermined threshold which causes the echo 
cancellation module to be disabled to cancel newly 
15 arriving Sin signals. After the Echo Canceller Disable 

Tone has been detected, the Echo Canceller block is given 
an indication through a control signal to disable Echo 
Cancellation . 

At step 1217, the echo cancellation disable tone was 
20 not detected and the energy of the tone is less than the 
given predetermined threshold. The echo cancellation 
module is enabled or remains enabled if already in such 
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state. The Echo Canceller block is given an indication 
through a control signal to enable Echo Cancellation. 
This may indicate the end of Echo Canceller Disable Tone. 
The predetermined threshold level is a cutoff level 
5 to determine whether or not an Echo Canceller Disable 
Flag should be turned OFF. If the Tone Energy drops 
below a predetermined threshold, the Echo Cancellation 
disable flag is turned OFF. This flag is coupled into 
the Echo Canceller module. The Echo Canceller module is 

10 enabled or disabled in response to the echo cancellation 
disable flag. If the Tone energy is greater than the pre- 
determined threshold, then the processor jumps to step 
1213 as described above. In either case, whether or not 
the echo cancellation disable flag is set true or false 

15 or at steps 1213 or 1217, the next step in processing is 
the VAD module at step 1219. 

At step 1219, the data signal Sin is coupled into 
the voice activity detector module 1105 which is used to 
detect periods of voice/DTMF/ tone signals and periods of 

2 0 silence that may be present in the data signal Sin. The 
processor 150 jumps to step 1221. 

At step 1221, a determination is made whether 
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silence had been detected. If silence has been detected, 
the integrated telecommunications processor 150 jumps to 
step 1223 where an SID packet is prepared for 
transmission out as a packet on the packet network at the 
5 far end. If no silence is detected at step 1221, the 
processor couples the signal Sin into the ambient level 
control (ALC) module (not shown in FIG. 11) . At step 
1225, the ALC amplifies or de-amplifies the signal S in to 
a constant level. Integrated telecommunications 

10 processor 150 then jumps to step 1227 where 

DTMF/Generalized Tone detection is performed by the 
DTMF/Generalized Tone detection module 1106. The 
processor goes to step 1229. 

At step 122 9 a determination is made whether DTMF or 

15 tone signals have been detected. If DTMF or tone signals 
have been detected, integrated telecommunications 
processor 150 generates DTMF or tone packets at step 1231 
for transmission out the packet network at the far end. 
If no DTMF or tone signals are detected at step 122 9, the 

20 signal N is a voice/speech signal and the G . 7XX encoding 
module 110 9 encodes the speech into a speech packet at 
step 1233. A speech packet 1235 is then transmitted out 
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the packet network side to the far end. 

Referring now to figure 13, a flow chart diagram of 
the telephony processing of packet data from the network 
side at the far end by the integrated telecommunications 
processor 150 into R out signals at the near end is 
illustrated. The integrated telecommunications processor 
150 receives packet data from the far end over the packet 
network 101. At step 13 01, a determination is made as to 
what type of packet has been received. The integrated 
telecommunications processor 150 is expecting one of five 
types of packets. The five packet types that are 
expected are a fax packet 13 03, a DTMF packet 13 04, a 
Tone packet 13 05, or a speech or SID packet 13 06. 

If at step 1301 a determination has been made that a 
fax packet 13 03 has been received, data from the packet 
is coupled into a fax demodulation module by the 
integrated telecommunications processor at step 1308. At 
step 13 08, the fax demodulation module demodulates the 
data from the packet using fax demodulation into Rout 
signals at the near end. If at step 1301 a determination 
has been made that a DTMF packet 13 04 has been received, 
the data from the packet is coupled into the DTMF 
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generation module 1107 at step 1310. At step 1310, the 
DTMF generation module 1107 generates DTMF tones from the 
data in the packet Rout signals at the near end. If at 
step 13 01 the packet received is determined to be a tone 
packet 13 05, the data from the packet is coupled into the 
tone generation module 1108 at step 1312. At step 1312, 
the tone generation module 1108 generates tones as Rout 
signals at the near end. If at step 13 01 a determination 
has been made that speech or SID packets 13 0 6 have been 
received, the data from the packet is coupled into the 
G.7xx decoding module 1110 at step 1314. At step 1314, 
the G . 7xx decoding module 1110 decompresses the speech or 
SID data from the packet into Rout signals at the near 
end. 

If at step 13 01 a determination has been made that 
the packet is either a DTMF packet 13 04, a tone packet 
13 05, a speech packet or an SID packet 1306, the 
integrated telecommunications processor 150 jumps to step 
1318. If at step 1318, the echo canceller flag is 
enabled, the R out signals from the respective module is 
coupled into the echo cancellation module. These R out 
signals are the Far End Input to the Echo Canceller whose 
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echo, if not cancelled, rides on the Near End Signal when 
it gets transmitted to the other end. At step 1318, the 
respective R out signal from a module in conjunction with 
the S in signal and the Echo Canceller Enable Flag from the 
5 nearend is used to perform echo canceling. The Echo 

Canceller Enable Flag is a binary flag which turns ON and 
OFF the Echo Canceling operation in step 1318. When this 
flag is ON, the NearEndln signals are processed to cancel 
the potential echo of the FarEnd. When this flag is OFF , 

10 the NearEndln signal by-passes the Echo Canceling as is. 
Referring now to Figure 14A, a block diagram of the 
data flows and interaction between exemplary functional 
blocks of the integrated telecommunications processor 150 
for telephony processing is illustrated. There are two 

15 data flows in the voice over packet (VOP) system provided 
by the integrated telecommunications processor 150. The 
two data flows are TDM- to- Packet and Packet -to- TDM which 
are both executed in tandem to form a full duplex 
system . 

2 0 The functional blocks in the TDM- to-Packet data flow 

includes the Echo Canceller 1403, the tone detector 1404, 
the voice activity detector (VAD) 14 05, the automatic 
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level controller (ALC) 1401, DTMF detector 1405, and 
packetizer 1409. The Echo Canceller 1403 substantially 
removes a potential echo signal from the near end of 
gateway. The Tone Detector 1404 controls the echo 
5 canceller and other modules of the integrated 

telecommunications processor 150. The tone detector is 
for detecting the EC Disable Tone, the FAXCED tone, the 
FAXCNG tone and V21 'IE' flags. The tone detector 1404 
can also be programmed to detect a given number of 

10 signaling tones also. The VAD 1405 generates Silence 

Information Descriptor (SID) when speech is absent in the 
signal from the near end. The ALC 1401 optimizes volume 
(amplitude) of speech. The DTMF detector 14 0 5 looks for 
tones representing DTMF digits. The Packetizer 1409 

15 packetizes the appropriate payloads in order to send 
packets . 

The functional blocks in the Packet to TDM Flow 
include: the Depacketizer 1410, the Comfort Noise 
Generator (CNG) 142 0, the DTMF Generator 14 07, the PCM to 
20 linear converter 1421, and the optional Narrowband signal 
detector 1422. The Decoder 1410 depackets the packet 
type and routes it appropriately to the CNG 142 0, the PCM 
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to linear converter 1421 or the DTMF generator 1407. The 
CNG 142 0 generates comfort noise based on an SID packet. 

The DTMF generator 1407 generates DTMF signals of a 
given amplitude and duration. The optional Narrowband 
5 signal detector 1422 detects when it is undesirable for 
the echo canceller to cancel the echo of certain tones on 
Rin side. The PCM to Linear converter 1421 converts A- 
law/mu-law encoded speech into 16-bit linear PCM samples. 
However, this block can easily be replaced by a general 

10 speech decoder (e.g. G.7xx speech decoder) for a given 
communications channel by swapping out the appropriate 
firmware code . The TDM IN/OUT block 1424 is a A-law/mu- 
law to linear conversion block (i.e. 1102, 1103) which 
occurs at the TDM interface. This could be performed by 

15 hardware or can be programmed and performed by firmware. 

The integrated telecommunications processor is a 
modular system. It is easy to open new communication 
channels and support numerous channels simultaneously as 
a result. These functional modules or blocks of the 

20 integrated telecommunications processor 150 interact with 
each other to achieve complete functionality. 

Communication between blocks or modules, that is 
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inter functional -block communication, is carried out by 
using shared memory resources with certain access rules. 

The location of the shared area in memory is called 
Inter functional -block data (InterFB data) . All 
functional blocks of the integrated telecommunications 
processor 150 have permission to read this shared area in 
memory but only a few blocks or modules of the integrated 
telecommunications processor 150 have permission to write 
into this shared area of memory. The InterFB data is a 
fixed (reserved) area in memory starting at a memory 
address such as 0x0050H for example. All the functional 
blocks or modules of the integrated telecommunications 
processor 150 communicate with each other if need using 
this shared memory or InterFB data. The same shared 
memory area may be used for both TDM- Packet and Packet - 
TDM data flows or they may be split into different shared 
memory areas . 

The table below indicates a sample set of parameters 
that may be communicated between functional blocks in the 
integrated telecommunications processor 150. The column 
"Parameter Name" indicates the parameter while the 
"Function" column indicates the function the parameters 
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assist in performing. The "Write/Read Access" column 
indicates what functional blocks can read or write the 
parameter . 



Parameter Name 


Write/Read Access 


Function 


td initialize 


Script (w) , 

tone detect (w/r) 


Initializes state 
for TD 


Ecdisable_detect , 
f axced_detect , 
f axcng_detect , 
faxv21 detect, 


Td (w) , ec (r , w) 


Switching ALC, EC 
ON/OFF 


Key, dtmf_detect 


Dtmf (w) , 
packetizer (r) 


Indicates dtmf 
digit presence 


Vad_decision, 
noise level 


Vad (w) , cng (r) , 
script/ale (r) 


Voice decision, 
SID for CNG 


Tone_f lag, 
f requencyl , 
f requency2 


Narrowband (w) , 
ec/ script (r) 


Indicates 
narrowband signal 
on Rin 



5 



The interaction between the functional blocks or 
modules and the respective signals are now described. 
The echo canceller 1403 receives both the Sin signal and 

10 Rin signal in order to generate the Sout signal as the 
echo cancelled signal. The echo canceller 1403 also 
generates the Rout signal which is normally the same as 
Rin. That is, no further processing is performed to the 
Rin signal in order to generate the Rout signal in most 

15 cases. The echo canceller 1403 operates over both data 
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flows in that it receives from the TDM end as well as 
data from the packet side. The echo canceller 14 03 
properly functions only when data is fully available in 
both the flows. When a TDM frame (Sin) is ready to be 
processed, a packet is grabbed from the packet buffer and 
decoded (Rin) and put into memory. The TDM frame is the 
Sin signal data from which the echo needs to be removed. 
The decoded packet is the Rin data signal . 

The tone detector 14 04 receives the output Sout from 
the echo canceller 1403. The tone detector 1404 looks 
for the EC Disable Tone, the FAXCED tone, the FAXCNG tone 
and the tones representing V21 A 7E' flags. The tone 
detector functions on Sout data after the echo canceller 
1403 has completed its data processing. The tone 
detector's main purpose is to control other modules of 
the integrated telecommunications processor 150 by 
turning them ON or OFF. The tone detector 14 04 is 
basically a switching mechanism for the modules such as 
the Echo Canceller 1403 and the ALC 1401. The tone 
detector can write the ecdisable flag in the shared 
memory while the echo canceller 1402 reads it. The tone 
detector or Echo Canceller writes an ALCdisable flag in 
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the shared memory while the ALC 1401 reads it. Most 
events detected by the tone detector are used by the echo 
canceller in one way or another. For example, the Echo 
Canceller 1403 is to turn OFF when an ecdisable tone is 
detected by the tone detector 1404. Modems usually send 
the /ANS signal (or ecdisable tone) to disable the echo 
cancellers in a network. When the tone detector 1404 of 
the integrated telecommunications processor 150 detects 
the ecdisable tone, it writes a TRUE state into the 
memory location representing ecdisable flag. On the next 
TDM data packet flow, the echo canceller 1403 reads the 
ecdisable flag to determine it is to perform echo 
cancellation or not. In the case its disabled, the echo 
canceller 1403 generates Sout as Sin with no echo 
canceling signal added. The ecdisable flag is updated to 
a FALSE state by the echo canceller 14 03 when the root 
mean squared energy of Sin (RMS) falls below -36dbm 
indicating no tone signals. 

In certain cases it is undesirable for the ALC 1401 
to modify the amplitude of a signal such as when sending 
FAX data. In this case it is desirable for the ALC 1041 
to be turned ON and OFF. In most cases an ANS tone is 

042390. P12534 81 
Express Mail: EL802874811US 



required to turn the ALC 14 01 OFF. When the tone 
detector 14 04 detects an ANS tone, it writes a TRUE state 
into the memory location for the ALC disable flag. The 
ALC 14 01 reads the shared memory location for the ALC 
disable flag and turns itself ON or OFF in response to 
its state. Another condition that ALC disable flag may be 
turned ON could be a signal from the Echo Canceller 
saying there was no detected Near End signal. This may be 
the case when the Sout signal is below a given threshold 
level . 

When the tone detector detects an EC disable tone, 
it turns OFF the echo canceller 1403 (G.168). When the 
tone detector detects a FAXCED tone (ANS), it turns OFF 
the ALC 14 01 (G.169) and provides a data by-pass for FAX 
processing. When the tone detector detects a FAXCNG tone, 
it provides a data by pass for FAX processing. When the 
tone detector simultaneously detects three V21 l 7E' Flags 
in a row, it provides a data by pass for FAX processing. 

The VAD 1405 is used to reduce the effective bitrate 
and optimize the bandwidth utilization. The VAD 14 05 is 
used to detect silence from speech. The VAD encodes 
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periods of silence by using a Silence Information 
Descriptor rather than sending PCM samples that represent 
silence. In order to do so, the VAD functions over 
frames of data samples of Sout . The frame size can vary 
depending on situations and needs of different 
implementations with a typical frame representing 80 data 
samples of Sout. If the VAD 1405 detects silence, it 
writes a voice_activity flag in the shared memory to 
indicate silence. It also measures the noise power level 
and writes a valid noise_power level into a shared memory 
location. 

Referring now to Figure 14B, a flow chart of an 
algorithm for performing voice activity detection 1449 
utilizing the voice activity detection module/processor 
1405 is illustrated. Samples of Input Speech Signal x [n] 
are inputted into a framer 1450. The framer 1450 
typically produces frames that have a length of 4 0 
samples (5 milliseconds) or 80 samples (10 milliseconds) 
of pulse code modulated speech. After a frame of data 
has been formed by the framer 1450, the frame is analyzed 
by five different processes. These five processes 
include fast Fourier transform (FFT) processing 1451, 
zero crossing detection 1452, noise detection 1453, 
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energy discrimination 14 54, and instantaneous energy 
discrimination 1455. The processes operate on a frame by 
frame basis. 

These processes 1451-1455 each set or clear a flag 
5 for the respective process (i.e. there are 5 flags) that 
are used in order to make a intermediate voice activity 
detection decision at step 1460. Further, the 
intermediate voice activity detection decision 1460 can 
then weigh the processing steps 1451-1455 in a number of 

10 ways. The fast Fourier transform (FFT) process 1451 can 
set or clear the fast Fourier transform flag. The zero 
crossing detection process 1452 can set or clear the zero 
crossing flag. The noise detection process 1453 can set 
or clear the noise flag. The energy discrimination 

15 process 1454 can set or clear the energy flag. The 

instantaneous energy discrimination process 1455 can set 
or clear the instantaneous energy flag. 

In one embodiment, if the energy flag is set or the 
instant energy flag is set and the noise flag is cleared 

20 and the zero crossing flag is cleared, then the 

intermediate voice activity detection decision 1460 is 
set to indicate that voice has been detected. In general 
this decision could also have a weighting of a previous 
frame, or previous frames on the different flags. 
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Otherwise, the intermediate voice activity detection 
decision 1460 is cleared indicating that no voice was 
detected such that silence is present. 

After completing the intermediate voice activity 
5 detection decision 1460, the voice activity detection 
algorithm 1449 jumps to a HangOver and Speech Kick In 
process 1461. At step 1461, HangOver processing and 
Speech Kick In processing is performed and the voice 
activity detection flag is either set or cleared in 

10 response thereto. The HangOver processing 1461 looks 

back over prior frames to determine if a series of past 
frames have the voice activity detection flags set or 
cleared. If the Voice Activity Detection in the past 
frame is set, then a HangOver counter is set to a given 

15 number (e.g. 4 or 5) . If the past frame has the Voice 
Activity Detection Flag as zero (cleared) , then the 
Hangover counter is decremented by 1. The Voice Activity 
Detection (VAD) Flag is not set to zero unless the 
HangOver Counter is Zero and the current Interim VAD 

20 Decision says that this frame is not a voice frame. This 
HangOver Processing ensures a smooth transition from 
speech to silence. 

In the same manner, the Speech Kick In looks for a 
set number of consecutive frames (e.g. three consecutive 
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frames) where the Interim VAD flag has been declared to 
be 1 (going from silence to speech) before setting the 
Voice Activity Detection (VAD) Flag to 1. This ensures 
that a spurious declaration of speech is not made while 
5 transitioning from silence to speech. 

At step 1462, a determination is made whether step 
1461 has currently set or cleared the voice activity 
detection flag. If a determination is made at step 1462 
that the voice activity detection flag is set, the 

10 algorithm 1449 jumps to step 1463. At step 1463, the 

voice activity detector algorithm activates an automatic 
level control if other conditions are met. It further 
sends a speech payload to be packetized and updates the 
voice activity detection flag for external interaction 

15 with other blocks of the integrated telecommunication 

processor 150. The algorithm 1449 then proceeds to the 
next frame. If at step 1462 a determination has been 
made that the voice activity detection flag is cleared, 
the algorithm 1449 jumps to step 1464. At step 1464, the 

2 0 voice activity detector algorithm disables the automatic 
level control and causes a silence insertion description 
payload to be prepared. It further updates the silence 
insertion description payload and the voice activity 
detection flag for external interaction with the other 
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modules of the integrated telecommunications processor 
150 . 

Referring now to Figure 14C, the algorithm for the 
fast Fourier transform (FFT) processing 1451 of the input ' 
5 speech for the voice activity detector is illustrated. 
The FFT processing 1451 is used to find a tone signal as 
distinguished from speech or silence. After the framer 
1450 has framed the data, the fast Fourier transform 
processing 1451 can begin. At step 1470, an N point 

10 digital fast Fourier transform is performed. N can be 32 
or 64 or any other power of 2 . The digital fast Fourier 
transform at step 14 70 converts the time domain data into 
a given number of frequency bins for the given frame of 
data. The FFT processing 1451 then jumps to step 1472. 

15 At step 1472, the adjacent bins' squared values are added 
together and we get half the number of values (n/2) . The 
FFT processing then jumps to step 1474. At step 1474, a 
bin peak finder process is performed which finds 2 peaks 
of the sum of adjacent bins' squared values obtained in 

20 the previous step neglecting the zero frequency peak P0 . 
P0, if a tone is present (e.g. a signaling tone), will 
have a very high energy level. As discussed, bin 
magnitude calculator 1472 generates N divided by 2 values 
(N is the size of the FFT) . Of the N/2 values generated 
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by the bin magnitude calculator 1472, two peaks, PI and 
P2 (e.g. having the highest energy values), are selected 
by the bin peak finder 1474. Peaks PI and P2 , if speech 
is present, could represent the high energy level speech 
harmonics. The processing 1451 then jumps to step 1476. 

At step 14 76, the peak value difference is made with the 
peak threshold to determine if the fast Fourier transform 
flag should be set or cleared. At step 1476 the lOloglO 
difference between the zero frequency peak PO and the 
residual peak sum is generated. The residual peak sum is 
determined by summing all the bins determined in step 
1470 and by subtracting the two peak values PI and P2 
therefrom. Thus, the residual peak sum equals the sum of 
all bins - PI - P2 . The processing then jumps to step 
1478. At step 1478, the peak value difference is 
compared with a pre-determined peak threshold. If at 
step 1478 it is determined that the peak value difference 
is greater than or equal too the peak threshold, then the 
fast Fourier transform flag is set to one indicating a 
tone is present. Otherwise, the FFT flag is cleared and 
speech, silence, or other signals are assumed present. 

Referring now to Figure 14D, the flow diagram for 
the zero crossing detector 1452 is illustrated. After 
the fraraer 1450 frames the data input samples into a 
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frame, the zero crossing detection 1452 begins at step 
1480. At step 1480, the variable J, which is the sample 
number within the given frame, is initialized to zero. 
The zero crossing detector 1452 then jumps to step 1481. 
5 In step 1481, the frame length is considered with the 
variable J. If it is determined at step 1481 that the 
frame length is greater than J, then the zero crossing 
detector 1452 jumps to step 1482. If it is determined 
that the frame length is less than or equal to J at step 

10 1481, then the zero crossing detector jumps to step 1484 
which will be discussed later. At step 1482 the current 
data sample x[j] is multiplied together with the previous 
sample x[j-l] which is compared to zero to determine if 
there is a sign reversal between adjacent samples. If 

15 step 14 82 determines that there is no sign reversal 

between samples then the zero crossing detector returns 
to step 1481. If it is determined that there is a sign 
reversal between adjacent samples in step 14 82, the zero 
crossing detector jumps to step 1483. At step 1483, a 

2 0 running count of the zero crossings is incremented by one 
and the process performed by the zero crossing detector 
1452 goes back to step 1481. 

At step 1484, a Root Mean Squared value of zero 
crossing is determined by an equation. The Root Mean 
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Squared value of the zero crossing is given by the 
equation: RMS zero crossing equals alpha times zero 
crossing count + (1 -alpha ) times RMS zero crossing. 
Alpha is a fraction less than 1. The zero crossing 
5 detector process 1452 then continues to step 1485. At 
step 14 85 a determination is made whether the RMS zero 
crossing value is greater than a threshold value. If it 
is determined that the RMS zero crossing is greater than 
the threshold value zero crossing flag is set. Speech 

10 tends to have a high number of zero crossings. Thus, a 

greater number of zero crossings tends to indicate speech 
is present. If it is determined that the RMS zero 
crossing is less than or equal to the threshold value a 
zero crossing flag is cleared. Then the Zero Crossing 

15 Detector proceeds to the next frame. 

Referring now to Figure 14E, the flow chart of the 
process of the noise detection 1453 for the voice 
activity detector is illustrated. After the speech input 
samples are put in a frame by the framer 145 0, the noise 

2 0 detection process 1453 steps through two branches one at 
step 1488 and another at step 1489. At step 1488 the 
autocorrelation of the frame is determined through the 
equation r[0] . The equation r[0] is the summation J=0 to 
J=N-1 for the equation [x (j ) *x (n) ] . At step 1489 an 
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autocorrelation of the frame using a delay of 10 samples 
is made. Thus, a 10 th order correlation is made on this 
frame using the equation of block 1489. Autocorrelation 
for r [10] is the summation over J=0 to J=n-1 for the 
5 equation [x (n) *x (n- 10 ) ] . 

After completion of step 1488, the processes jumps 
to process 1490. In the step 1490, a root mean squared 
calculation of the autocorrelation r[0] is determined by 
the equation shown in block 1490. 

10 After the completion of the step 1489, the process 

jumps to 1491. At step 14 91, a root mean squared 
calculation of the other correlation r[10] is made 
through the equation shown in block 1491. 

After completing the steps 1490 and 1491, the noise 

15 detection process jumps to step 1492. At step 1492 a 
determination is made as to whether the root means 
squared of the autocorrelation r[0] (i.e. r[0]_RMS) of the 
frame multiplied by a correlation threshold is greater 
than the root means squared of the autocorrelation of the 

20 frame using a tenth delayed sample (i.e. r [10] _RMS) . 

That is, if the energy in the current frame multiplied by 
a constant is greater than the frame energy of a delayed 
sample (10 th order correlation) , then noise has been 
detected. If not, speech is most likely present because 
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the speech will generally have a high r[10]_RMS value due 
to the presence of harmonics and. because speech tends to 
be highly correlated. If step 1492 makes the 
determination that noise is present, the noise flag is 
5 set by the noise detection process 1453. If at step 1492 
the determination is made that noise was not present, the 
noise flag is cleared by the noise detection processes 
1453 . In either case the process continues by processing 
the next frame of data. 

10 Referring now to Figure 14F, the process of the 

energy discriminator 1454 is illustrated to determine the 
amount of energy present in a frame. After the data 
sample input x [n] is framed by the framer 1450 the energy 
discriminator starts at step 1494. At step 1494, the 

15 auto correlation of the frame of data r[0] is made. The 
equation for r[0] is illustrated in the block 1494. 
After completing the auto correlation of the frame in 
step 1494, the energy discriminator 1454 jumps to step 
1495. At step 1495, the logarithm of the autocorrelation 

2 0 of the frame is compared against an energy threshold. If 
it is determined that in step 1495 that the logarithm of 
the autocorrelation of the frame is greater than an 
energy threshold, the energy discriminator 1454 sets the 
energy flag to 1 and jumps to the next frame. Thus, if 
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the energy threshold, is met, then there is a greater 
likelihood speech is present. If the logarithm of the 
autocorrelation of the frame is less than the energy 
threshold, the energy flag is cleared, the energy 
5 discriminator goes to the next frame and at step 1496 the 
energy threshold is updated in the energy discriminator 
process 1454. The energy threshold is updated to keep 
track of background noise. Thus, this step updates the 
energy threshold only when the energy flag is found to be 

10 set to zero. 

Referring now to Figure 14G, a flow diagram of the 
process for the instantaneous energy discriminator 1455 
is illustrated. After the speech input samples are 
framed by the framer 1450. The steps 1465 and 1466 are 

15 begun in parallel within the instantaneous energy 

discriminator 1455. At step 1465, the autocorrelation of 
the frame is determined by the equation in block 1465. 
Additionally, the previous autocorrelation calculation 
(i.e. prevR[0] ) is updated, which means that the process 

20 updates the stored previous frame's r[0] value by the 

current frame's r[0]. After step 1465 is completed the 
instantaneous energy level process jumps to step 1469. 
At step 1466, the autocorrelation of the frame is made 
using a 10 th order delayed sample as shown by the equation 
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illustrated in block 1466. Thus, a 10 th order correlation 
is made using this equation. After correlation of the 
tenth sample, the process for the instantaneous energy 
discriminator jumps to step 1467. At step 1467 the root 
means squared calculation from the 10th sample is made. 
Additionally the previous root means squared calculation 
of the correlation r[10] is updated. After completing 
step 1467 the process jumps to step 1468. 

At step 1468, the instantaneous energy discriminator 
process determines a difference between root means 
squared value of the auto correlation of the current 
frames tenth delayed sample from the root means squared 
value of the auto correlation of the previous frames 
tenth delayed sample by the equation in the block 1468. 
After completion of the calculation in step 1468, 
instantaneous energy discriminator process 1455 jumps to 
step 1469. 

At step 1469, a determination is made as to whether 
the value of the difference between the current frames 
energy at the autocorrelation of the tenth sample and the 
prior frames energy at the autocorrelation of the tenth 
sample is greater than the previous frames 

autocorrelation multiplied by a starting threshold by the 
equation shown in figure 14G at block 1469. If it is 
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determined that this difference is greater than the 
previous frames autocorrelation multiplied by the 
starting threshold, then the instantaneous flag is set 
and the instantaneous energy discriminator 1455 jumps to 
process the next frame. The difference rlO corresponds 
to the difference of higher-order harmonics 
(representative of speech) between two consecutive frames 
(possibly of speech) . Thus, if this value is greater 
than the previous frame's autocorrelation multiplied by a 
starting threshold it more likely represents a speakers 
change in pitch (e.g. a speaker goes from talking at a 
normal voice to high pitched voice) rather than an 
instantaneous burst of noise . 

Otherwise, if in step 1469 it is determined that 
the difference is less than or equal to the previous 
frames autocorrelation multiplied by the starting 
threshold, then the instantaneous energy discriminator 
clears the instantaneous flag and goes on to process the 
next frame of data. 

Returning again to Figure 14A, the ALC 1401 reads 
the voice_activity flag and applies gain control if voice 
is detected. Otherwise if the voice_activity flag 
indicates silence, the ALC 1401 does not apply gain and 
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passes Sout through without amplitude change as its 
output . 

The packetizer/encoder 1409 reads the voice activity 
flag to determine if a current frame of data contains a 
5 valid voice signal or not. If the current frame is 
voice, then the output from the ALC needs to be added 
into the PCM payload. If the current frame is silence 
and an SID has been generated by the VAD 14 05, the 
packetizer/encoder 1049 reads the SID information stored 

10 in the shared memory in order for it to be packetized. 

The ALC 1401 functions in response to the VAD 1405. 
The VAD 14 0 5 may look over the last one or more frames 
of data to determine whether or not the ALC information 
should be added to a frame or not . 

15 The ALC 1401 applies gain control if voice is detected 

else Sout is passed through without any change. The tone 
detector 1404 disables and enables the ALC 1401 as 
described above to comply with the G.169 specification. 
Additionally, the ALC 14 01 is disabled when Sout signal 

20 level goes below certain threshold (-40 dBm for example) 
after Echo Cancellation by the echo canceller 1403. If 
current frame contains valid voice data, then the output 
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gain information from the ALC 1401 is added to the PCM 
payload by the packetizer. Otherwise if silence is 
detected, the packetizer uses the SID information to 
generate packets to be sent as the send_packets . 

The DTMF detector 14 06 functions in response to the 
output from the ALC 1401. The DTMF detector 1406 uses 
an internal frame size of 102 data samples but it adapts 
to any frame size of data samples. DTMF signaling events 
for a current frame are recorded in an InterFB area of 
shared memory. High level programs use DTMF signaling 
events stored in the InterFB area. Typically the high 
level program reads all the necessary info and then 
clears the contents for future use. 

The DTMF detector 14 06 may read the VAD_activity 
flag to determine if voice signals are detected. If so, 
the DTMF detector may not execute until other signal 
types, such as tones, are detected. If the DTMF detector 
detects that a current frame of data contains valid DTMF 
digits, then a special DTMF payload is generated for the 
packetizer. The special DTMF payload contains relevant 
information needed to faithfully regenerate DTMF digits 
at the other end. The packetizer/encoder generates DTMF 
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packets for transmission over the send_packet output. 

The Packet izer /Encoder 1409 includes a packet header 
of 1 byte to indicate which data type is being carried in 
the payload. The payload format depends on the data being 
5 transported. For example, if the payload contains PCM 
data then the packet will be quite larger than an SID 
packet for generating comfort noise. The packetizing may 
be implemented as part of the integrated 

telecommunications processor or it may be performed by an 

10 external network processor. 

The Depacketizer/Decoder 1410 receives a stream of 
packets over rx_packet and first determines what type of 
packet it is by looking at the packet header. After 
making a determination as to the type of packet received, 

15 the appropriate decoding algorithm can be executed by the 
integrated telecommunications processor. The type of 
packets and their possible decoding functions include 
Comfort Noise Generation (CNG) , DTMF Generation, and 
PCM/Voice decoding. The Depacketizer/Decoder 1410 

2 0 generates frames of data which are used as Rin. In many 
cases, a single frame of data is generated by one packet 
of data. 
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The comfort noise generator (CNG) 142 0 receives 
commands from the depacketizer/decoder 1410 to generates 
a "comfortable" pink noise in response receiving an SID 
frame as a payload in a packet on the rx_packet . The 
comfort noise generator (CNG) 142 0 generates the 
"comfortable" pink noise at a level corresponding to the 
noise power indicated in the SID frame. In general, the 
comfort noise generated can have any spectral 
characteristics and is not limited to pink noise. 

The DTMF Generator 14 0 7 receives commands from the 
depacketizer and generates DTMF tones in response to the 
depacketizer receiving a DTMF payload in a packet on 
rx packet . The DTMF tones generated by the DTMF 
Generator 1407 correspond to amplitude levels, key, and 
possibly duration of the corresponding DTMF digit 
described in the DTMF payload. 

Referring now to Figure 15, exemplary memory maps of 
the memories of the integrated telecommunications 
processor 150 and their inter-relationship are 
illustrated. Figure 15 illustrates an exemplary memory 
map for the global buffer memory 210 to which each of the 
core processors 200 have access. The program memory 204 
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and the data memory 2 02 for each of four core processors 
200A-200D (Core 0 to Core 3) is also illustrated in 
Figure 15 as being stacked upon each other. The program 
memory 204C and the data memory 202C for the core 
processor 2 00C (Core 2) is expanded in Figure 15 to show 
an exemplary memory map. Figure 15 also illustrates the 
file registers 413 for one of the core processors, core 
processor 200C (Core 2) . 

The memory of the integrated telecommunications 
processor 150 provides for flexibility in how each 
communication channel is processed. Firmware and data 
can be swapped in and out of the core processors 2 00 when 
processing a different job. Each job can vary by 
channel, by frame, by data blocks or otherwise with 
changes to the firmware. In one embodiment, each job is 
described for a given frame and a given channel . By 
providing the functionality in firmware and swapping the 
code into and out of program memory of the core 
processors 200, the functionality of the integrated 
telecommunications processor 150 can be easily modified 
and upgraded. 

Figure 15 also illustrates the interrelationship 
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between the global buffer memory 210, data memory 2 02 for 
the core processors 200, and the register files 413 in 
the signal processing units 3 00 of each core processor 
200. The multichannel memory movement engine 208 
flexibly and efficiently manages the memory mapping so as 
to extract the maximum efficiency out of each of the 
algorithm signal processors 3 00 for a scalable number of 
channels. That is, the integrated telecommunications 
processor 15 0 can support a varying number of 
communication channels which is scalable by adding 
additional core processors because the signal processing 
algorithms and data are stored in memory are easily 
swapped into and out of many core processors . 
Furthermore, the memory movement engine 208 can sequence 
through different signal processing algorithms to provide 
differing module functionality for each channel. 

All algorithm data and code segments are completely 
relocatable in any memory space in which they are stored. 

This allows processing of each frame of data to be 
completely independent from the processing of any other 
frame of data for the same channel. In fact, any frame 
of data may be processed on any available signal 
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processor 300. This allows maximum utilization of the 
processor resources at all times. 

Frame processing can be partitioned into several 
pieces corresponding to algorithm specific functional 
blocks such as those for the integrated 

telecommunications processor illustrated in Figures 11- 
14. The "fixed" (non- changing) code and data segments 
associated with each of these functional blocks can be 
independently located in a memory space which is not 
fixed and only one copy of these segments need be kept 
regardless of the number of channels which are to be 
supported. This data can be downloaded and/or upgraded 
at any time prior to it's use. A table of pointers, for 
example, can be used to specify where each of these 
blocks currently resides in a memory space. In addition, 
dynamic data spaces required by the algorithms, which are 
modifiable, can be allocated at run- time and de-allocated 
when no longer needed. 

When a frame (s) for a particular channel is ready 
for processing, only the code and data for .the functional 
blocks required for the specified processing of the frame 
need be referenced. A "script" specifying which of these 
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functional blocks is required can be constructed in real 
time on a frame by frame basis. Alternately, pre- 
existing scripts which contain functional block 
references identified by an identifier for example can be 
5 called and executed without addresses . In this case the 
locations of the functional blocks in any memory space 
are "looked" up from a table of pointers, for example. 

Furthermore, DMA can be utilized if the code and/or 
data segments for a functional block must be transferred 

10 from one memory space to another memory space in order to 
reduce the overhead associated with processor 
intervention in such transfer. Since the code and data 
blocks required by any functional block are completely 
independent of each other, "chains" of DMA transfers can 

15 be defined and executed to transfer multiple blocks from 
one memory space to another without processor 
intervention. These "chains" can be created or updated 
when needed based on the current processing requirements 
for a particular channel using the "catalog" of 

20 functional blocks currently available. A DMA module 

creating a description of DMA transfers can optimize the 
use of the destination memory space by locating the 
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segments wherever necessary to minimize wasted space. 

In Figure 15, functional blocks and channel specific 
segments are arranged in the memory spaces of the global 
buffer memory 210 and called into the data memory 2 02 and 
5 program memory 204 of a core processor 200. In the 

exemplary illustration of Figure 15, the Global buffer 
memory 210 includes an Algorithm Processing (AP) Catalog 
1500, Dynamic Data Blocks 1515, Frame Data Buffers 152 0, 
Functional-Block (FB) & Script Header Tables 1525, 

10 Channel Control Structures 1530, DMA Descriptors List 
1535, and a Channel Execution Queue 1540. 

Figure 16 is a block diagram illustrating another 
exemplary memory map for the global buffer memory 210 of 
the integrated telecommunications processor 150 and the 

15 inter-relationship of the blocks contained therein. 

Referring to Figures 15 and 16, the Algorithm 
Processing (AP) Catalog 1500 includes channel 
independent, algorithm specific constant data segments, 
code data segments and parameter data segments for any 

2 0 algorithm which may be required in the integrated 

telecommunications processor system. These algorithms 
include telecommunication modules for Echo cancellation 

042390. P12534 104 
Express Mail: EL802874811US 



(EC) , tone detection and generation (TD) , DTMF detection 
and generation (DTMF), G . 7xx CODECS, and other functional 
modules. Examples of the code data segments include DTMF 
code 1501, TD code 1502, and EC code 1503 for the DTMF, 
5 TD and EC algorithms respectively. Examples of the 

algorithm specific constant data segments include DTMF 
constants 1504, TD constants 1505, and EC constants 1506 
for the DTMF, TD and EC algorithms respectively. 
Examples of the parameter data segments include DTMF 

10 parameters 1507, TD parameters 1508, and EC parameters 
1509 for the DTMF, TD and EC algorithms respectively. 

The Algorithm Processing (AP) Catalog 1500 also 
includes a set of scripts (each containing a script data, 
script code, and a script DMA template) for each kind of 

15 frame processing required by the system. The same script 
may be used for multiple channels, if these channels all 
require the same processing. The scripts do not contain 
any channel specific information. Figure 15 illustrates 
script 1 data 1511A, script 1 code 1512A, and a script 1 

20 DMA template 1513A through script N data 1511N, script N 
code 1512N, and script N DMA template 1513N. 

The script 1 blocks (script 1 data 1511A, script 1 
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code 1512A, script 1 DMA template 1513A) in the AP 
catalog 1500 define the functional blocks required to 
accomplish specific processing of a frame of data of a 
any channel which requires the processing defined by this 
5 script and the addresses into the program memory 2 04 

where the functional block code should be transferred and 
the data memory 2 02 where the data segments should be 
transferred. Alternately, these addresses into the 
program memory 2 04 and data memory 2 02 where the data 

10 segments should be transferred could be determined at run 
time by a core memory management function. The script 1 
blocks also specify the order of execution of the 
functional blocks by one of the core processors 200. The 
script 1 code 1512A for example may define the functional 

15 blocks and order of execution required to accomplish echo 
cancellation and DTMF detection. Alternately, it could 
describe the functional blocks and execution required to 
perform G . 7xx coding and decoding. Note also that the 
script 1 blocks can specify "conditional" data transfer 

2 0 and execution such as a data transfer or an execution 

which depends on the result of another functional blocks 
results. For example these conditional data transfers may 
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include those surrounding the functional blocks such as 
whether or not call progress tones are detected. The 
script 1 DMA template 1513A associated with the script 1 
blocks specifies the sequence in which the data should be 
5 transferred into and out of the data memory and program 
memory of one of the core processors 200. Additionally, 
the script DMA templates associated with each script 
block is used to construct the one or more channel 
specific DMA descriptors in the DMA descriptors list 1535 

10 in the global memory buffer 210. 

The global buffer memory 210 also includes a table 
of Functional Block and Script Headers referred to as the 
FB and Script Header tables 1525. The FB and Script 
Headers tables 1525 includes the size and the global 

15 buffer memory starting addresses for each of the 

functional blocks segments and script segments contained 
in the AP Catalog 1500. For example referring to Figure 
16, the DTMF header table includes the size and starting 
addresses for the DTMF code 1501, the DTMF constants 1504 

20 and the DTMF parameters 1507. A script 1 header table 

includes the size and starting addresses for the script 1 
data 1511A, the script 1 code 1512A, and the script 1 DMA 

042390 . P12534 107 
Express Mail: EL802874811US 



template 1513A. FB and Script Headers table 1525 in 
essence points to these blocks in the AP catalog 1500 
including others such as the EC Code 1503, the EC 
constants 1506 and the EC Parameters 1509. The contents 
5 of FB and Script Header tables 1525is updated whenever a 
new AP catalog 1500 is loaded or an existing AP catalog 
1500 is updated in the global buffer memory 210 . 

The global buffer memory also has channel specific 
data segments consisting of dynamic data blocks 1515 and 

10 frame data buffers 1520. The dynamic data blocks 1515 
illustrated in the exemplary map of Figure 15 includes 
the dynamic data blocks for channels n (CHn) through 
channel p (CHp) . The type of dynamic data blocks for 
each channel corresponds to the functional modules used 

15 in each channel. For example as illustrated in Figure 

15, channel n has EC dynamic data blocks, TD dynamic data 
blocks, DTMF dynamic data blocks, and G . 7xxx codec 
dynamic data blocks. In Figure 16, the dynamic data 
blocks required for channel 10 are chlO-DTMF, chlO-EC and 

2 0 chlO-TD, required for channel 102 are Chi 02 -EC and chl02- 
G.7xx, and required for channel 86 is Ch86-EC. 

The frame data buffers 152 0 include channel specific 
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data segments for each channel for the far in data, far 
out data, near in data and near out data. The near in 
data and near out data are for the PSTN network side 
while the far in data and the far out data are for the 
5 packet network side. Note that n channels may be 
supported such that there may be n sets of channel 
specific dynamic data segments and n sets of channel 
specific frame buffer data segments. In Figure 16, the 
channel specific frame data segments include chlO-Near In 

10 data, chlO-Near Out data, chlO-Far In data, chlO-Far Out 
data, chl02-Near In, chl02-Far In, chl02-Near Out and 
chl02-Far Out in the frame data buffers 1520. The 
channel specific data segments and the channel specific 
frame data segments allows the integrated 

15 telecommunications processor 150 to process a wide 
variety of communication channels having differing 
parameters at the same time. 

The set of channel control structures 1530 in the 
global buffer memory 210 includes all information 

20 required to process the data for a particular channel. 
This information includes the channel endpoints (e.g. 
source and destination of TDM data, source and 
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destination of packet data) , a description of the 
processing required (e.g. Echo cancellation, VAD, DTMF, 
Tone detection, coding, decoding, etc , to use) . It also 
contains pointers to locate the data resources required 
5 for processing (e.g. the script, the dynamic data blocks, 
the DMA descriptor list, the TDM (near in and near out) 
buffers, and the packet data (far in and far out) 
buffers) . Statistics regarding the channel are also 
maintained in the channel control structure. This 

10 includes such things as the # of frames processed, the 
channel state (e.g. Call setup, fax/voice/data mode, 
etc), bad frames received, etc) . In Figure 16, the 
channel control structures include channel control 
structures for channel 10 and channel 102 each of which 

15 point to respective dynamic data blocks 1515 and frame 
data buffers 1520. 

The DMA Descriptor lists 1535 in the global buffer 
memory 210 defines the source address, destination 
address, and size for every data transfer required 

2 0 between the Global buffer memory 210 and the program 

memory 2 04 and data memory 2 02 for processing the data of 
a specific channel. Thus, n sets of DMA descriptor lists 
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exist for processing n channels. Figure 15 illustrates 
the DMA descriptors list 1535 as including CHm DMA 
descriptors list through CHn DMA descriptors list. In 
Figure 16, the DMA Descriptor Lists 1535 includes CH 10 - 
5 DMA descriptors and CH 102 - DMA descriptors. 

The global buffer memory 210 further has a Channel 
Execution Queue 1540. The Channel Execution Queue 1540 
schedules and monitors processing jobs for all the core 
processors 200 of the integrated telecommunications 

10 processor 150. For example, when a frame of data for a 
particular channel is ready to be processed, a 
"management function" creates or updates the DMA 
descriptor list for that channel based on the Script and 
block addresses found in the FB headers of the FBH table 

15 152 5 and/or channel control structure found in the script 
block 1530. The job is then scheduled for processing by 
the Channel Execution Queue 1540. The DMA descriptor 
list 1535 includes the transfer of the script itself from 
the global buffer memory 210 to the data memory 2 02 and 

2 0 program memory 2 04 of the core processor 2 00 that will 
process that job. Note that the core addresses are 
specified in such a way that they are applicable to ANY 
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core which may process the job. The same DMA descriptor 
list may be used to transfer data to any one of the cores 
in the system. In this way, all necessary information to 
process a frame of data can be constructed ahead of time, 
5 and any core which may then become available can perform 
the processing. 

Consider the scheduled job 1 in the session 
execution queue 154 0 of Figure 16, for example. 
Scheduled job 1 points to the Ch 10 - DMA descriptors in 

10 the DMA Descriptor list 1535 for frame 40 of channel 10. 
The scheduled job n points to the Ch 102 - DMA 
descriptors in the DMA Descriptor list 1535 to process 
frame 106 of channel 102. 

The upper portion of the program memory 204C and 

15 data memory 2 02C illustrates an example of the program 
memory 2 04C including script code 155 0, DTMF code 1551 
for the DTMF generation and detection, and EC code 1552 
for the echo cancellation module. The code stored in the 
program memory 204 varies depending upon the needs of a 

20 given communication channel. In one embodiment, the code 
stored in the program memory 2 04 is swapped each time a 
new communication channel is processed by each core 
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processor 20 0. In another embodiment, only the code that 
needs to be swapped out, removed or added in the program 
memory 204 each time a new communication channel is 
processed by each core processor 2 00. 

The lower portion of the program memory 2 04C and 
data memory 202C illustrates the data memory 202C which 
includes script data 1560, interf unct ional block data 
area 1561, DTMF constants 1504, DTMF Parameters 1507, CHn 
DTMF dynamic data 1562, EC constants 1506, EC Parameters 
1509, CHn EC dynamic data 1563, CHn Near In Frame Data 
1564, CHn Near Out Frame Data 1566, CHn Far In Frame Data 
1568, and CHn Far Out Frame Data 1570, and other 
information for additional functionality or additional 
functional telecommunications modules. These constants, 
variables, and parameters (i.e. data) stored in the data 
memory 2 02 varies depending upon the needs of a given 
communication channel. In one embodiment, the data 
stored in the data memory 2 02 is swapped each time a new 
communication channel is processed by each core processor 
2 00. In another embodiment, only the data that needs to 
be swapped out, removed or added into the data memory 202 
each time a new communication channel is processed by 
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each core processor 200. 

Figure 15 illustrates the Register File 413 for the 
core processor 200A (core 0) . The register file 413 
includes a serial port address map for the serial port 
5 206 of the integrated telecommunications processor 150, a 
host port address map for the host port 214 of the 
integrated telecommunications processor 150, core 
processor 2 00A interrupt registers including DMA pointer 
address, DMA starting address, DMA stop address, DMA 
10 suspend address, DMA resume address, DMA status register, 
and a software interrupt register, and a semaphore 
address register. Jobs in the channel execution queue 
1540 load the DMA pointer in the file registers 412 of 
the core processor. 

15 Figure 17 is an exemplary time line diagram of 

processing frames of data. The integrated 

telecommunications processor processes multiple frames of 
multiple channels. The time required to process a frame 
of data for any particular channel is in most cases much 
20 shorter than the time interval to receive the next 

complete frame of data. The time line diagram of Figure 
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17 illustrates two frames of data for a given channel, 
Frame X and Frame X+l, each requiring about twelve units 
of time to receive. The frame processing time is 
typically shorter and is illustrated in Figure 17 for 
example as requiring two units each to process Frame X 
and Frame X+l. For the same channel it can be expected 
that the processing time for each frame is similar. Note 
that there is about ten units of delay time between the 
completion of processing of Frame X and the start of 
processing of Frame X+l. It would be an inefficient use 
of resources for a processor to sit idle during this 
delay time between received frames waiting for a new 
frame of data to be received in order to start 
processing . 

To avoid inefficiencies, the integrated 
telecommunications processor 150 processes jobs for other 
channels and their respective frames of data instead of 
sitting idle between frames for one given channel . The 
integrated telecommunications processor 150 processes 
jobs which are completely channel and frame independent 
as opposed to processing one or more dedicated channels 
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and their respective frames. Each frame of data for any 
given channel can be processed on any available core 
processor 200. 

Referring now to Figure 18, an exemplary time line 
diagram of how one or more core processors 200A-200N of 
the integrated telecommunications processor 150 processes 
jobs on frames of data for multiple communication 
channels. The arrows 1801A-18 01E in Figure 18 represent 
jobs or idle time for the core processor 1 200A. The 
arrows 18 02A-18 02D represent jobs or idle time for the 
core processor 2 200B. The arrows 1803A-1803E represent 
jobs or idle time for the core processor N 2 0 ON. Arrows 
1801D and 1803C illustrated idle time for core processor 
1 and core processor N respectively. Idle times occur 
for a core processor only when there is no data available 
for processing on any currently active channel. The 
Ch### nomenclature above the arrows refers to the channel 
identifier of the job that is being processed over that 
time period by a given core processor 200. The Fr### 
nomenclature above the arrows refers to the frame 
identifier for the respective channel of the job that is 
being processed over that time period by the given core 
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processor 200. 

The jobs, including a job description, are stored in 
the channel execution queue 154 0 in the global buffer 
memory 210. In one embodiment of the present invention, 
5 all channel specific information is stored in the Channel 
Control Structure, and all required information for 
processing the job is contained in the (channel 
independent) script code and script data, and the 
(channel dependent) DMA. descriptor list which is 

10 constructed prior to scheduling the job. The job 
description stored in the channel execution queue, 
therefore, need only contain a pointer to the DMA 
descriptor list. 

Core processor 2 0 OA, for example, processes job 

15 1801A, job 1801B, job 1801C, waits during idle 1801D, and 
processes job 1801E. The arrow or job 1801A is a job 
which is performed by core processor 1 2 00A on the data 
of frame 10 of channel 5. The arrow or job 1801B is a 
job on the data of frame 2 of channel 40 by the core 

20 processor 1 200A. The arrow or job 1801C is a job on the 
data of frame 102 of channel 0 by the core processor 1 
200A. The arrow or job 1801E is a job on the data of 
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frame 11 of channel 87 by the core processor 1 200A. 
Note that core processor 1 200A is idle for a short 
period of time during arrow or idle 18 0 ID and otherwise 
use to process multiple jobs. 
5 Thus, Figure 18 illustrates an example of how job 

processing of frames of multiple telecommunication 
channels can be distributed across multiple core 
processors 2 00 over time in one embodiment of the 
integrated telecommunications processor 150. 

10 Because jobs are processed in this manner, the 

number of channels supportable by the integrated 
telecommunications processor 150 is scalable. The 
greater the number of core processors 200 available in 
the integrated telecommunications processor 150 the more 

15 channels that can be supported. The greater the 

processing power (speed) of each core processor 150, the 
greater the number of channels that can be supported. 
The processing power in each core processor 20 0 may be 
increased for example such as by faster hardware (faster 

20 transistors such as by narrower channel lengths) or 
improved software algorithms. 

As those of ordinary skill will recognize, the 
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present invention has many advantages. One advantage of 
the present invention is that telephony processing is 
integrated into one processor. Another advantage of the 
present invention is that improved telephone 
5 communication channels are provided between a time 
division multiplexed (TDM) telephone network and a 
packet i zed network. Another advantage of the present 
invention is that all the telecommunications modules 
couple together as a unit and the interrelationships 

10 among different modules can then be exploited. As a 

result, the present invention enables aggregating a large 
number of TDM channels by providing all Telephony 
functions, compression, decompression and transceiving as 
separate packet channels over a packet network. The 

15 control mechanism of the present invention can process 

the data inputs and outputs of different TDM channels and 
sequence them efficiently for channel based signal 
processing in the hardware. 

The preferred embodiments of the present invention 
20 are thus described. While the present invention has been 
described in particular embodiments, it may be 
implemented in hardware, software, firmware or a 
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combination thereof and utilized in systems, subsystems, 
components or sub -components thereof. When implemented 
in software, the elements of the present invention are 
essentially the code segments to perform the necessary 
5 tasks. The program or code segments can be stored in a 
processor readable medium or transmitted by a computer 
data signal embodied in a carrier wave over a 
transmission medium or communication link. The 
"processor readable medium" may include any medium that 

10 can store or transfer information. Examples of the 

processor readable medium include an electronic circuit, 
a semiconductor memory device, a ROM, a flash memory, an 
erasable ROM (EROM) , a floppy diskette, a CD-ROM, an 
optical disk, a hard disk, a fiber optic medium, a radio 

15 frequency (RF) link, etc. The computer data signal may 

include any signal that can propagate over a transmission 
medium such as electronic network channels, optical 
fibers, air, electromagnetic, RF links, etc. The code 
segments may be downloaded via computer networks such as 

2 0 the Internet, Intranet, etc. In any case, the present 
invention should not be construed as limited by such 
embodiments, but rather construed according to the 
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claims . 
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